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TnUE OF THE INVENTION 

SYNTHETIC GENE ENCODING HUMAN CARCINOEMBRYONIC ANTIGEN AND USES 
THEREOF 

5 FIELD OF THE INVENTION 

The present invention relates generally to the therapy of cancer. More specifically, the 
present invention relates to synthetic polynucleotides encoding the human tumor associated polypeptide 
carcinoembryonic antigen, herein designated hCEAopt, wherein the polynucleotides are codon-optimized 
for expression in a human cellular enviromnent. The present invention also provides recombinant vectors 
10 and hosts conq)rising said synthetic polynucleotides. This invention also relates to adenoviral vector and 
plasmid constructs carrying hCEAopt and to their use in vaccines and pharmaceutical coiiq>ositions for 
preventing and treating cancer. 

BACKGROUND OF THE INVENTION 

15 The iromunoglobulin superfamily (IgSF) consists of numerous genes that code for 

proteins with diverse functions, one of which is intercellular adhesion. IgSF proteins contain at least one 
Ig-related domain that is important for maintaining proper intermolecular binding interactions. Because 
such interactions are necessary to the diverse biological functions of the IgSF members, disruption or 
aberrant expression of many IgSF adhesion molecules has been correlated with many human diseases. 

20 The carcinoembryonic antigen (CEA) belongs to a subfamily of the Ig superfamily 

consisting of cell surface glycoproteins. Members of the CEA subfamily are known as CEA-related cell 
adhesion molecules (CEACAMs). In recent scientific literature, the CEA gene has been renamed 
CEACAM5, although the nomenclature for the protein remains CEA. Functionally, CEACAMs have 
been shown to act as both homotypic and heterotypic intercellular adhesion molecules (Benchimol et al., 

25 Cell 57: 327-334 (1989)). In addition to cell adhesion, CEA inhibits cell death resulting firom detachment 
of cells firom the extracellular matrix and can contribute to cellular transformation associated with certain 
proto-oncogenes such as Bcl2 and C-Myc {see Berinstem, J. Clin Oncol 20(8): 2197-2207 (2002)). 

Normal expression of CEA has been detected during fetal development and in adult 
colonic mucosa. CEA overexpression was first detected in human colon tumors over thirty years ago 

30 (Gold and Freedman, 7. Exp, Med. 121:439-462 (1965)) and has since been found in nearly all colorectal 
tumors. Additionally, CEA overexpression is detectable in a high percentage of adenocarcinomas of the 
pancreas, breast and lung. Because of the prevalence of CEA expression in these tumor types, CEA is 
widely used clinically in the management and prognosis of these cancers. 

Sequences coding for human CEA have been cloned and characterized (U.S. Patent No. 

35 5,274,087; U.S. Patent No 5,571,710; and U.S. Patent No 5,843,761. See also Beauchemin et al., Mol 
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Cell Biol. 7:3221-3230 (1987); Zimmerman et al., Proc. Natl Acad. Set USA 84:920-924 (1987); 

Thompson et al. Proc. Natl Acad. Sci. USA 84(9):2965-69 (1987)). 

The correlation between CEA expression and metastatic growth has led to its 

identification as a target for molecular and immunological intervention for colorectal cancer treatment 
5 One therapeutic approach targeting CEA is the use of anti-CEA antibodies {see Chester et al., Cancer 

Chemother. Pharmacol 46 (Suppl): S8-S12 (2000)), while another is to activate the immune system to 

attack CEA-expressing tumors using CEA-based vaccines (for review, see Berinstein, supra). 

The development and commercialization of many vaccines have been hindered by 

difficulties associated with obtaining high expression levels of exogenous genes in successfully 
10 transformed host organisms. Therefore, despite the identification of the wild-type nucleotide sequences 

encoding CEA proteins described above, it would be highly desurable to develop a readily renewable 

source of human CEA protein that utilizes CEA-encodmg nucleotide sequences that are optimized for 

expression in the intended host cell, said source allowing for the development of a cancer vaccine which 

is efficacious and not hindered by self-tolerance. 

15 

SUMMARY OF THE INVENTION 

The |M:esent invention relates to conq)ositions and methods to elicit or enhance immunity 
to the protein products expressed by CEA genes, which have been associated with numerous 
adenocarcinomas, including colorectal carcinomas. Specifically, the present invention provides 

20 polynucleotides encoding human CEA protein, wherein said polynucleotides are codon-optimized for 
high level expression in a human cell. The present invention further provides adenoviral and plasmid- 
based vectors comprising the synthetic polynucleotides and discloses use of said vectors in immunogenic 
coirpositions and vaccines for the prevention and/or treatment of CEA-associated cancer. 

The present invention also relates to synthetic nucleic acid molecules (polynucleotides) 

25 comprismg a sequence of nucleotides that encode human carcinoembryonic antigen (hereinafter hCEA) 
as set forth in SEQ ID N0:2, wherein the synthetic nucleic acid molecules are codon-optimized for high- 
level expression in a human cell (hereinafter hCEAopt). The nucleic acid molecules disclosed herein may 
be transfected into a host cell of choice wherein the recombmant host cell provides a source for 
substantial levels of an expressed functional hCEA protein (SEQ ID NO:2). 

30 The present invention further relates to a synthetic nucleic acid molecule which encodes 

mRNA that expresses a human CEA protein; this DNA molecule comprising the nucleotide sequence 
disclosed herem as SEQ ID NO: 1. A preferred aspect of this portion of the present invention is disclosed 
in FIGURE 1, which shows a DNA molecule (SEQ ID NO: 1) that encodes a hCEA protein (SEQ ID 
N0:2 or SEQ ID NO: 16). The preferred nucleic acid molecule of the present invention is codon- 

35 optimized for high-level expression in a human cell. 
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Another preferred DNA molecule of the present invention comprises a sequence of 
nucleotides that encodes a human CEA that is deleted of its C-termmal anchoring domain (AD), which is 
located fix)m about ammo acid 679 to about amino acid 702 of the human fuU-length CEA (SEQ ID 
N0:2), wherein said sequence of nucleotides is codon-optimized for high level expression in a human 

5 cell. An exemplary DNA molecule encoding a CEA variant that is truncated of its anchoring domain is 
set forth in SEQ ID NO: 15 (shown in FIGURE lOA). The corresponding amino acid sequence of hCEA- 
AAD is set forth in SEQ ID NO: 16 (shown in FIGURE lOB). 

The present invention also relates to recombinant vectors and recombinant host cells, 
both prokaryotic and eukaryotic, which contain the nucleic acid molecules disclosed throughout this 

10 specification. 

The present invention further relates to a process for expressing a codon-optimized 
human CEA protein in a recombinant host cell, comprising: (a) introducing a vector comprismg a nucleic 
acid molecule as set forth in SEQ ID N0:1 or SEQ ID NO: 15 into a suitable host cell; and, (b) culturing 
the host cell under conditions which allow expression of said codon-optimized human protein. 

15 Anotha: aspect of this invention is a method of preventing or treating cancer comprising 

administering to a mamTnal a vaccine vector comprising a synthetic nucleic acid molecule, the synthetic 
nucleic acid molecule comprising a sequence of nucleotides that encodes a human carcinoembryonic 
antigen (hCEA) protein as set forth in SEQ ID N0:2 or SEQ ID NO:16, wherein the synthetic nucleic 
acid molecule is codon-optimized for high level expression in a human cell. 

20 The present invention further relates to an adenovirus vaccine vector comprising an 

adenoviral genome with a deletion in the El region, and an insert in the El region, wherem the insert 
conq)rises an expression cassette comprising: (a) a codon-optimized polynucleotide encoding a hunoan 
CEA protein; and (b) a promoter operably linked to the polynucleotide. 

The present invention also relates to a vaccine plasnoid comprising a plasmid portion and 

25 an expression cassette portion, the expression cassette portion con5)rising: (a) a synthetic polynucleotide 
encoding a hmnan CEA protem, wherein the synthetic polynucleotide is codon-optimized for optimal 
expression in a human cell; and (b) a promoter operably linked to the polynucleotide. 

Another aspect of the present invention is a method of protecting or treating a m am m al 
from cancer or treating a marnmal suffering from CEA-associated cancer comprising: (a) introducing mto 

30 the TTiflinmal a first vector comprising: i) a codon-optimized polynucleotide encoding a human CEA 

protein; and ii) a promoter operably linked to the polynucleotide; (b) allowing a predetermined amoimt of 
time to pass; and (c) introducing into the mammal a second vector comprising: i) a codon-optimized 
polynucleotide encoding a human CEA protein; and ii) a promoter operably linked to the polynucleotide. 
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As used throughout the specification and in the appended claims, the singular forms "a," 
"an," and ^'the" include the plural reference unless the context clearly dictates otherwise. 

As used throughout the specification and appended claims, the following definitions and 
abbreviations apply: 

5 The term "promoter" refers to a recognition site on a DNA strand to which the RNA 

polymerase binds. The promoter forms an initiation complex with RNA polymerase to initiate and drive 
transcriptional activity. The complex can be modified by activating sequences termed "enhancers" or 
inhibiting sequences termed "silencers"* 

The term "cassette" refers to the sequence of the present invention that contains the 
10 nucleic acid sequence which is to be expressed. The cassette is sinndlar in concept to a cassette tape; each 
cassette has its own sequence. Thus by interchanging the cassette, the vector will express a different 
sequence. Because of the restriction sites at the 5' and 3' ends, the cassette can be easily inserted, 
removed or replaced with another cassette. 

The term "vector" refers to some means by which DNA fragments can be introduced into 
IS a host organism or host tissue. There are various types of vectors including plasmid, virus (including 
adenovirus), bacteriophages and cosmids. 

The term ^'first generation " as used in reference to adenoviral vectors, describes said 
adenoviral vectors that are replication-defective. First generation adenovirus vectors typically have a 
deleted or inactivated El gene region, and preferably have a deleted or inactivated E3 gene region. 
20 The designation "pVlJ/hCEAopt" refers to a plasmid construct disclosed herein 

conqirising the human CMV immediate-early (IE) promoter with intron A, a fiiU-length codon-optimized 
human CEA gene, bovine growth hormone-derived polyadenylation and transcriptional termination 
sequences, and a minimal pUC backbone (see EXAMPLE 2). The designation "pVlJ/hCEA" refers to a 
construct as described above, except the construct comprises a wild-type hmnan CEA gene instead of a 
25 codon-optimized human CEA gene. 

The designations 'MRKAdS/hCEAopt" and ^TSlRKAdS/hOEA" refer to two constructs, 
disclosed herem, which comprise an Ad5 adenoviral genome deleted of the El and E3 regions. In the 
"MRKAdS/hCEAopt" construct, the El region is replaced by a codon-optimized human CEA gene in an 
El parallel orientation under the control of a human CMV promoter without intron A, followed by a 
30 bovine growth hormone polyadenylation signal. The '"MRKAdS/hCEA" construct is essentially as 
described above, except the El region of the AdS genome is replaced with a wild-type human CEA 
sequence (see EXAMPLE 2). 

The term "effective amount" means sufficient vaccine con[q>osition is introduced to 
produce the adequate levels of the polypeptide, so that an immune response results. One skilled in the art 
35 recognizes that this level may vary. 
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A "conservative amino acid substitution" refers to the replacement of one amino acid 
residue by another, chemically similar* amino acid residue. Examples of such conservative substitutions 
are: substitution of one hydrophobic residue (isoleucine, leucine, valine, or methionine) for another, 
substitution of one polar residue for another polar residue of the same charge (e.g., arginine for lysine; 
5 glutamic acid for aspartic acid). 

"hCEA" and 'liCBAopf ' refer to a hunaan carcinoernbryonic antigen and a human codon- 
optimized carcinoembryonic antigen, respectively. 

The term "hCEA-Z^AD" refers to a variant of human CEA that is deleted of its C-terminal 
anchoring domain (AD), which is located from about amino acid 679 to about amino acid 702 of the 
10 human fiilHength CEA (SEQ ID N0:2). Nucleotide sequences encoding hCEA-AAD of the present 
invention, are codon-optimized for high-level expression in a human cellular envuronment (designated 
herein hCEAopt-AAD'*). An exemplary DNA molecule encoding a CEA variant that is truncated of its 
anchoring domain is set forth in SEQ ID NO: 15 (shown in FIGURE lOA). The corresponding amino acid 
sequence of hCEA-AAD is set forth m SEQ ID NO: 16 (shown in FIGURE lOB). Nucleotides encoding 
15 hCEA-AAD are useful for the development of a cancer vaccine for treatment and/or prophylaxis of 
cancer. 

The term "mammalian'* refers to any mammal, including a human being. 
The abbreviation "Ag" refers to an antigen. 

The abbreviations "Ab" and "mAb" refer to an antibody and a monoclonal antibody, 

20 respectively. 

The abbreviation "ORF* refers to the open reading frame of a gene. 

BRIEF DESCRIPTION OF THE DRAWINGS 

FIGURE 1 shows the nucleotide sequence of wild-type human CEA cDNA (SEQ ID 
25 N0:3) and of the codon optimized clone (hCEAopt, SEQ ID NO: 1). The deduced amino acid sequence is 
shown on top (SEQ ID N0:2)- The substituted nucleotides of the synthetic codon optknized cDNA are 
shown below the hCEA cDNA sequence. See EXAMPLE 2. 

FIGURE 2 shows the expression of hCEA in injected mice. Groups of 10 C57BL/6 mice 
were injected in die quadriceps muscle either with various doses of MRKAd5-hCEA and MRKAd5- 
30 hCEAopt (Panel A) or with 25 or 50 micrograms of plasmids pVlJ/hCEA and pVl J/hCmopt (Panel B), 
Blood saDDples were collected 3 days postinjection and CEA levels were measured. Filled triangles 
represent CEA measurement of individual mice. Geometric mtean values are also shown (filled circle). 

FIGURE 3 shows that codon optimization increases the immune response to human 
CEA. Groups of 8 C57BL/6 mice were injected via the quadriceps muscle either with various doses of 
35 MRKAd5-hCEA and MRKAd5-hCEAopt Virus injections were carried out at 0 and 21 days. Panel A. 
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At two weeks post boosting injection, the number of CD8+ IFNy secreting T cells specific for hCEA was 
determined by EUSPOT assay on splenocytes from individual mice (filled triangles) using peptide 143 
that covers aa 569-583 and includes a CD8+ epitope. Two different amounts of splenocytes (2.5 x 105 
and 5 X 105) and two replicas of each tested amount of splenocytes. Average values were calculated by 

5 subtracting the background level determined in the absence of peptides (typically less than 10 SFai06 
total splenocytes), and the results were expressed as the number of SFCyi06 total splenocytes. Values 
from individual mice are shown (filled triangles) as well as the geometric mean values (filled circle). 
Panel B. Anti-CEA antibody titers m sera from individual mice (filled triangles) were measured using 10 
days post boost serum samples. Geometric mean tit^ (GMT) (filled circles) are also shown. 

10 Ad/hCEAopt is significantiy different from Ad/hCEA 

FIGURE 4. Comparison of different immunization regimens. Groups of C57BU6 (A) or 
BALB/c (B) mice were immunized with different combinations of plasmid pVlJ/hCEA (50|xg/dose 
electioinjected in the quadriceps muscle) and MRKAd5/hCEA (lxl09 pp/dose). The number of IFNy- 
secreting T cells in splenocytes m each mdividual mouse was determined using a pool of peptides 

15 covering aa 497-703 (pool D) as described in materials and methods and in tiie legend to FIGURE 3. 
Geometric mean values are also shown (filled circles). D/D and D/A are significantly different from 
Ad/Ad group in C57BL/6 mice. All three groups are significanfly different in BALB/c mice. 

FIGURE 5 shows the results of mapping of T-cell responses to selected regions of the 
hCEA protein. Groups of C57BL/6 (Panel A) or BALB/c (Panel B) mice were inamunized with 50^ig of 

20 plasmid pVlJ/hCEA and boosted tiiree weeks later with lxl09 pp of Ad/hCEA. The number of IFNy- 
secieting T cells m splenocytes in each individual mouse was determined two weeks post-boost using 
pool of peptides covering ttie entire protem as described in materials and methods and in the legend to 
Figure 3. Geometric mean values are also shown (filled circles). 

FIGURE 6. Identification of immunoresponsive peptides of hCEA. Pooled splenocytes 

25 from 4 immunized C57BL/6 (Panel A) or BALB/c (Panel B) mice were assayed for IFNy secretion 
against each indicated peptide by ELISPOT assay (see EXAMPLE 8). 

FIGURE 7 shows the sequence of epitope containing peptides for CE A in C:57BL/6 mice 
(Panel A) and BALB/c mice (Panel B) (see EXAMPLE xx). Listed to tiie right are tiie percent of 
IFNy producing CD8+ (CD4+) C3)3+ cells. 

30 FIGURE 8 shows results from an IFNy-EUSPOT assay of immunized CEA transgenic 

mice as described in EXAMPLE 9. Mice were unmunized wifli four electroinjections of plasmid DNA 
one week apart plus one adenovirus injection. For each inmiunogen, data were obtained witii pooled 
splenocytes of three injected mice. The C3)8-specific response was measured using peptide 143. 

FIGURE 9 shows IFNy -intracellular staking of inmiunized CEA.tg mice. Mice were 

35 inmiunized with 2 injections of IxlOlO vp of Adenovirus two weeks apart Shown are the data obtained 
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with pooled splenocytes of tbree injected mice. Listed to the right are the percent of CD8+ or CD4+ 
cells. 

FIGURE 10, Panel A, shows an exemplary codon-optimized DNA molecule encoding a 
CEA variant that is truncated of its anchoring domain as set forth in SEQ ID NO: 15. The corresponding 
5 amino acid sequence of hCEA-AAD is shown in Panel B (SEQ ID NO; 16). 

DETAILED DESCRIPTION OF THE INVENTION 

Carcinoembryonic antigen (CEA) is commonly associated with the development of 
adenocarcinomas. The present invention relates to compositions and methods to elicit or enhance 

10 inmnmity to the protein product expressed by the CEA tumor-associated antigen, wherein aberrant CEA 
expression is associated with the carcinoma or its development. Association of aberrant CEA expression 
with a carcinorua does not require that the CEA protein be expressed in tumor tissue at all timepoints of 
its development, as abnormal CEA expression may be present at tumor initiation and not be detectable 
late into tumor progression or vice-versa. 

15 To this end, synthetic DNA molecules encoding the human CEA protein are provided. 

The codons of the synthetic molecules are designed so as to use the codons preferred by the projected 
host cell, which in preferred embodiments is a human ceD. The syndietic molecules may be used for the 
development of recombinant adenovirus or plasmid-based vaccines, which provide effective 
immunoprophylaxis against CEA-associated cancer through neutralizing antibody and cell-mediated 

20 immunity. The synthetic molecules inay be used as an immunogenic composition. This invention 

provides polynucleotides which, when directiy introduced into a vertebrate in viw, including mammals 
such as primates and humans, induce the expression of encoded proteins within the animal. 

The wild-type hmnan CEA nucleotide sequence has been reported (See, e.g., U.S. Patent 
No. 5,274,087; U.S. Patent No 5,571,710; and U.S. Patent No 5,843,761). The present invention provides 

25 synthetic DNA molecules encoding the human CEA protein. The synthetic molecules of the present 

invention coiiq)rise a sequence of nucleotides, wherein some of the nucleotides have been altered so as to 
use the codons preferred by a human cell, thus allowing for high-level expression of CEA in a human host 
cell. The synthetic molecules may be used as a source of CEA protein, which may be used in a cancer 
vaccine to provide effective immunoprophylaxis against CEA-associated carcinomas through neutralizing 

30 antibody and cell-mediated immunity. 

A "triplet" codon of four possible nucleotide bases can exist in over 60 variant forms. 
Because these codons provide the message for only 20 different amino acids (as well as transcription 
initiation and termination), some amino acids can be coded for by more than one codon, a phenomenon 
known as codon redundancy. For reasons not completely understood, alternative codons are not 

35 uniformly present in the endogenous DNA of differing types of cells. Indeed, there appears to exist a 
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variable natural hierarchy or "preference" for certain codons in certain types of cells. As one example, 
the amino acid leucme is specified by any of six DNA codons including CT A, CTC, CTG, CTT, TTA, 
and TTG. Exhaustive analysis of genome codon frequencies for microorganisms has revealed 
endogenous DNA of £ coli most commonly contains the CTG leucine-specifymg codon, while the DNA 

5 of yeasts and slime molds most commonly includes a TTA leucine-specifying codon. In view of this 
hierarchy, it is generally believed that the likelihood of obtaining high levels of expression of a leucme- 
rich polypeptide by an iSl coli host will depend to some extent on the fi»quency of codon use. For 
example, it is likely that a gene rich m TTA codons will be poorly expressed in E. coli, whereas a CTG 
rich gene will probably be highly expressed in this host. Similarly, a preferred codon for expression of a 

10 leucme-rich polypeptide in yeast host cells would be TTA. 

The implications of codon preference phenomena on recombinant DNA techniques are 
manifest, and the phenomenon may serve to explain many prior failures to achieve high expression levels 
of exogmous genes in successfully transformed host organisms-a less "preferred" codon may be 
repeatedly present in the inserted gene and the host cell machinery for expression may not operate as 

15 efficiently. This phenomenon suggests that synthetic genes which have been designed to include a 

projected host cell's preferred codons provide an optimal form of foreign genetic material for practice of 
recombmant DNA techniques. Thus, one aspect of this invention is a human CEA gene that is codon- 
optunized foe expression in a human cell. In a preferred embodiment of this invention, it has been found 
that the use of alternative codons encoding the same protein sequence may remove the constraints on 

20 expression of exogenous CEA protein in human cells. 

In accordance with this invention, the human CEA gene sequence was converted to a 
polynucleotide sequence having an identical translated sequence but with alternative codon usage as 
described by Lathe, "Synthetic Oligonucleotide Probes Deduced firom Amino Acid Sequence Data: 
Theoretical and Practical Considerations" J. Molec, Biol 183:1-12 (1985), which is hereby incorporated 

25 by reference. The methodology generally consists of identifying codons in the wild-type sequence that 
are not commonly associated with highly expressed human genes and replacing them with optinoal codons 
for higji expression in human cells. The new gene sequence is then inspected for undesired sequences 
generated by these codon replacements (e.g., "ATTTA" sequences, inadv^nt creation of intron splice 
recognition sites, unwanted restriction enzyme sites, etc.). Undesirable sequences are eliminated by 

30 substitution of the existing codons with different codons coding for the same amino acid. The synthetic 
gene segments are then tested for improved expression. 

The methods described above were used to create synthetic gene sequences for human 
CEA, resulting in a gene comprising codons optimized for high level expression. While the above 
procedure provides a summary of our methodology for designing codon-optimized genes for use in 

35 cancer vaccines, it is understood by one skilled in the art that similar vaccine efficacy or increased 
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expression of genes may be achieved by minor variations in the procedure or by minor variations in die 
sequence. One of skill in the art will also recognize that additional DNA molecules may be constructed 
that provide for high levels of CEA expression in human cells, wherein only a portion of the codons of 
the DNA molecules are codon-optimized. 
5 Accordingly, the present invention relates to a synthetic polynucleotide comprising a 

sequence of nucleotides encoding a human CEA protein (SEQ ID N0:2), or a biologically active 
fragment or mutant form of a human CEA protein, including, but not limited to hCEA-AAD (SEQ E> 
NO: 16), the polynucleotide sequence comprising codons optimized for expression in a human host Said 
mutant forms of the CEA protein include, but are not limited to: conservative amino acid substitutions, 

10 amino-terminal truncations, carboxy-terminal truncations, deletions, or additions, collectively referred to 
herein as 'Variants". Any such biologically active fragment and/or mutant will encode either, a protein or 
protein fragment which at least substantially mimics the immunological properties of the CEA protein as 
set forth in SEQ ID NO:2. The synthetic polynucleotides of tiie present invention encode mORNA 
molecules that express a functional hmnan CEA'protein so as to be useful in the development of a 

15 therapeutic or prophylactic cancer vaccine. 

As stated above, the preset invention relates to nucleotides encoding a human CEA 
protein (SEQ ID NO:2), or a biologically active fragment or mutant form thereof. To this end, the present 
invention provides nucleotides encoding hCEA-AAD (SEQ ID NO: 16, FIGURE lOB), which comprises a 
hmnan CEA protein tiiat is deleted of its C-terminal anchoring sequence. The nucleic acid molecules of 

20 the present invention encoding hCEA-AAD are codon-optimized for enhanced expression in human cells. 
An exenq>lary nucleic acid molecule encoding hCEA-AAD comprises a sequence of nucleotides as set 
forth in SEQ ID N0:15 (HGURE lOA). 

The present invention relates to an synthetic nucleic acid molecule (polynucleotide) 
comprising a sequence of nucleotides which encodes mRNA that expresses a novel hCEA protein as set 

25 forth m SEQ ID N0:2, wherein die synthetic nucleic acid molecule is codon-optimized for high-level 
expression in a human host cell. The nucleic acid molecules of the present invention are substantially 
free from otiier nucleic acids. 

The present invention also relates to recombinant vectors and recombinant host cells, 
both prokarybtic and eukaryotic, which contain the nucleic acid molecules disclosed throughout this 

30 specification. The synthetic DNA molecules, associated vectors, and hosts of the present invention are 
useful for the development of a cancer vaccine. 

A preferred DNA molecule of the present invention conq>rises the nucleotide sequence 
disclosed herein as SEQ ID NO: 1, shown in RGURE 1, which encodes the himoan CEA protein shown in 
FIGURE 2 and set forth as SEQ ID NO: 2. 
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A further preferred DNA molecule of the present invention comprises the nucleotide 
sequence disclosed herem as SEQ ID NO: 15, shown in FIGURE lOA, which encodes a human CEA 
variant that is deleted of its C-terminal anchoring sequence, as set forth in SEQ ID NO: 16, and shown in 
HGURE lOB. 

The present mvention also includes biologically active fragments or mutants of SEQ ID 
NOs: 1, which encode mRNA expressing human CEA proteins. Any such biologically active fragment 
and/or mutant will encode either a protein or protein fragment which at least substantially mimics the 
phannacological properties of the hCEA protem, including but not Ihnited to the hCEA protem as set 
forth in SEQ ID N0:2. Any such polynucleotide includes but is not necessarily Umited to: nucleotide 
substitutions, deletions, additions, ammo-tenninal truncations and carboxy-terminal truncations. The 
mutations of the present invention encode mRNA molecules that express a functional hCEA protein m a 
eukaryotic cell so as to be useful in cancer vaccine development. 

This invention also relates to synthetic codon-optimized DNA molecules that encode the 
hCEA protem wherein the nucleotide sequence of the synthetic DNA differs significantly from the 
nucleotide sequence of SEQ ID NO:l, but still encodes the hCEA protein as set forth in SEQ ID N0:2. 
Such synthetic DNAs are intended to be withm the scope of the present invention. Therefore, the present 
invention discloses codon redundancy that may result in numerous DNA molecules expressing an 
identical protein. Also included within the scope of this invention are mutations in the DNA sequence 
that do not substantially alter the ultimate physical properties of the expressed protein. For exanq)le, 
substitution of valme for leucine, argmine for lysine, or asparagme for glutamine may not cause a change 
in die functionality of the polypeptide. 

It is known that DNA sequences coding for a peptide may be altered so as to code for a 
peptide that has properties that are different than those of the naturally occurring peptide. Methods of 
altering the DNA sequences mclude but are not limited to site directed mutagenesis. Examples of altered 
properties mclude but are not limited to changes in the affinity of an en2yme for a substrate or receptor 
for a ligand. 

The present invention also relates to hCEAopt fusion constructs, including but not hmited 
to fusion constructs which express a portion of the human CEA protein linked to various markers, 
including but m no way limited to GEP (Green fluorescent protein), the MYC epitope, GST, and Fc. Any 
such fusion construct may be expressed m the cell Une of mterest and used to screen for modulators of the 
human CEA protein disclosed herein. Also contenoplated are fusion constructs that are constructed to 
enhance the immune response to human CEA including, but not limited to: DOM and hsp70, and LTD. 

The present invention furflier relates to recombmant vectors that con^rise the synthetic 
nucleic acid molecules disclosed throughout this specification. These vectors may be comprised of DNA 
orRNA. For most clonmg purposes, DNA vectors are preferred. Typical vectors include plasmids. 
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modified viruses, baculovirus, bacteriophage, cosmids, yeast artificial chromosomes, and other forms of 
episomal or integrated DNA that can encode a hCEA protein. It is well within the purview of the skilled 
artisan to determine an appropriate vector for a particular gene transfer or other use. 

An expression vector containing codon-optimized DNA encoding a hCEA protein may 

5 be used for high-level expression of hCEA in a recombinant host cell. Expression vectors may include, 
but are not limited to, cloning vectors, modified clomng vectors, specifically designed plasmids or 
viruses. Also, a variety of bacterial expression vectors may be used to express recombinant hCEA in 
bacterial cells if desired, lii addition, a variety of fungal cell expression vectors may be used to express 
recombinant hCEA in fungal cells. Further, a variety of insect cell expression vectors may be used to 

10 express recombinant protein m insect cells. 

The present invention also relates to host cells transformed or transfected with vectors 
comprismg the nucleic acid molecules of the present invention. Recombinant host cells may be 
prokaryotic or eukarybtic, mcludmg but not lunited to, bacteria such as E. colU fungal cells such as yeast, 
mammalian cells including, but not limited to, cell lines of bovme, porcine, monkey and rodent origin; 

15 and insect cells including but not limited to Drosophila and silkworm derived cell lines. Such 

recombinant host cells can be cultured under suitable conditions to produce hCEA or a biologically 
equivalent form. In a prefi^red embodiment of the present invention, the host cell is human. As defined 
herein, the term *Tiost cell" is not intended to include a host cell m the body of a transgenic human being, 
human fetus, or human embryos. 

20 As noted above, an expression vector containing DNA encoding a hCEA protein may be 

used for expression of hCEA in a recombinant host cell. Therefore, another aspect of this invention is a 
process for expressing a human CEA protein or protein variant in a recombinant host cell, comprising: (a) 
introducing a vector conq)rising a nucleic acid as set forth in SEQ ID NO: 1 or SEQ ID NO: 15 into a 
suitable human host ceD; and, (b) culturing the host cell under conditions which allow expression of said 

25 human CEA protein or CEA protein variant. 

Following expression of hCEA in a host cell, hCEA protein may be recovered to provide 
hCEA protem in active fomt Several hCEA protein purification procedures are available and suitable for 
use. Recombinant hCEA protein may be purified firom cell lysates and extracts by various conibinations 
of, or mdividual application of salt fractionation, ion exchange chromatography, size exclusion 

30 chromatography, hydroxylapatite adsorption chromatography and hydrophobic mteraction 

chromatography. In addition, recombinant hCEA protein can be separated from other cellular proteins by 
use of an inomunoafGnity column made with monoclonal or polyclonal antibodies specific for full-length 
hCEA protein, or polypeptide fragments of hCEA protem. 

The nucleic acids of the present invention may be assembled into an expression cassette 

35 which conq)rises sequences designed to provide for efficient expression of the protein in a human cell. 
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The cassette preferably contains a fiill-length codon-optinaized hCEA gene, with related transcriptional 
and translations control sequences operatively linked to it, such as a promoter, and termination sequences. 
In a preferred embodiment, the promote is the cytomegalovirus promoter without the intron A sequence 
(CMV), although those skilled in the art will recognize that any of a number of other known promoters 

5 such as the strong immunoglobulin, or other eukaryotic gene promoters may be used. A preferred 

transcriptional terminator is the bovine growth hormone terminator, although other known transcriptional 
terminators may also be used. The combination of CMV-BGH terminator is particularly preferred. 

In accordance with this invention, the hCEAopt expression cassette is inserted into a 
vector. The vector is preferably an adenoviral vector, although linear DNA linked to a promoter, or other 

10 vectors, such as adeno-associated virus or a modified vaccmia virus, retroviral or lentiviral vector may 
also be used. 

If the vector chosen is an adenovirus, it is preferred that the vector be a so-called first- 
generation adenoviral vector. These adenoviral vectors are characterized by having a non-functional El 
gene region, and preferably a deleted adenovbral El gene region. In some embodiments, the expression 

15 cassette is inserted in the position where the adenoviral El gene is normally located. In addition, these 
vectors optionally have a non-functional or deleted E3 region. It is preferred that the adenovirus genome 
used be deleted of both the El and E3 regions (AE1AE3). The adenoviruses can be multiplied in known 
cell lines which express the viral El gene, such as 293 cells, or PERC.6 cells, or in cell lines derived from 
293 or PERC.6 cell which are transiently or stablily transformed to express an extra protein. For 

20 exanaples, when using constructs that have a controlled gene expression, such as a tetracycline regulatable 
promoter system, the cell line may express coiqponents involved in the regulatory system. One example 
of such a cell line is T-Rex-293; others are known in the art. 

For convenience in manipulating the adenoviral vector, the adenovirus may be in a 
shuttle plasmid form. This invention is also directed to a shutde plasmid vector which comprises a 

25 plasmid portion and an adenovmxs portion, the adenovirus portion conoprising an adenoviral genome 
which has a deleted El and optional E3 deletion, and has an inserted expression cassette comprismg 
codon-optimized human CEA. In preferred embodiments, there is a restriction site flanking the 
adenoviral portion of the plasmid so that the adenoviral vector can easily be removed. The shuttie 
plasmid may be replicated in prokaryotic cells or eukaryotic cells. 

30 In a preferred embodiment of the invention, the expression cassette is inserted into the 

pMRKAd5-HV0 adenovirus plasmid {See Emini et al., WO 02/22080, which is hereby incorporated by 
reference). This plasmid coiiqprises an Ad5 adenoviral genome deleted of the El and E3 regions. The 
design of the pMRKAd5-HV0 plasmid was improved over prior adenovectors by extending the 5* cis- 
acting packaging region further into the El gene to incorporate elements found to be important in 
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optfandzing viral packaging, resulting in enhanced virus amplification. Advantageously, this enhanced 
adenoviral vector is capable of maintaining genetic stability following high passage propagation- 
Standard techniques of molecular biology for preparing and purifying DNA constructs 
enable the preparation of the adenoviruses, shutde plasmids, and DNA immunogens of this invention. 

5 It has been determined in accordance with the present invention that the synthetic cDNA 

molecule described herem (SEQ ID NO: 1), which is codon-optimized for high-level expression m a 
human cell, is expressed with greater efficiency than the corresponding wild type sequence. Surprisingly, 
the codon optimized cDNA of hCEA breaks tolerance to hCEA more efficienfly than the wild type 
sequence. Additionally, it was shown herein that hCEAopt is more kmnunogenic that hCEA and is more 

10 efficient in eliciting both cellular and humoral immune responses. 

Therefore, the vectors descnbed above may be used in immunogenic conq>ositions and 
vaccines for preventing the development of adenocarcinomas associated with aberrant CEA expression 
and/or for treating existing cancers. The vectors of the present mvention allow for vaccine development 
and commercialization by eluninating difficulties with obtaining high expression levels of exogenous 

15 CEA in successfully transformed host organisms. To this end, one aspect of the iostant invention is a 
method of preventing or treating cancer coroprismg administering to a mammal a vaccine vector 
comprising a synthetic codon-optimized nucleic acid molecule, the synthetic codon-optnnized nucleic 
acid molecule comprising a sequence of nucleotides that encodes a human CEA protein as set forth in 
SEQIDN0:2. 

20 In accordance with the method described above, the vaccme vector may be administered 

for the treatment or prevention of cancer in any mammal. Iq a preferred embodiment of the mvention, the 
mammal is a human. 

Further, one of skill iu the art may choose any type of vector for use in the treatment and 
prevention method described. Preferably, the vector is an adenovirus vector or a plasmid vector. In a 
25 preferred embodiment of the invention, the vector is an adenovual vector con[q)rising an adenovural 

genome with a deletion in the adenovirus El region, and an insert in the adenovuiis £1 region, wherein 
the insert conoprises an expression cassette conoprising: (a) a synthetic codon-optimized polynucleotide 
encodmg a human CEA protein; and (b) a promoter op^ably Unked to the polynucleotide. 

The instant mvention further relates to an adenovirus vaccine vector conq)rising an 
30 adenovural genome with a deletion in the El region, and an msert in the El region, wherein the msert 
conq)rises an expression cassette conq)rising: (a) a synthetic codon-optimized polynucleotide encoding a 
human CEA protem; and (b) a promoter operably linked to the polynucleotide. 

In a preferred embodiment of this aspect of the invention, the adenovirus vector is an Ad 

5 vector. 
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In another preferred embodiment of the invention, the adenovirus vector is an Ad 6 

vector. 

In yet another preferred embodiment, the adenovmis vector is an Ad 24 vector. 

In another aspect, the invention relates to a vaccine plasmid comprising a plasmid portion 

5 and an expression cassette portion, the expression cassette portion comprising: (a) a synthetic codon- 
optimized polynucleotide encoding a human CEA protein or variant thereof; and (b) a promoter operably 
linked to the polynucleotide. 

In some embodiments of this invention, die recombinant adenovirus vaccines disclosed 
herein are used in various prime/boost combinations with a plasmid*based polynucleotide vaccine in 

10 order to induce an enhanced immune response. In this case, the two vectors are adnoinistered in a "prime 
and boost" regimen. For exaniqple the first type of vector is administered, then after a predetermined 
amount of time, for example, 2 weeks, 1 month, 2 months, six months, or otii^ appropriate interval, a 
second type of vector is administered. Preferably the vectors cany expression cassettes encoding the 
same polynucleotide or combination of polynucleotides. In the embodiment where a plasmid DNA is 

15 also used, it is preferred that the vector contain one or more promoters recognized by m a m malian or 
insect cells. Jn a preferred embodiment, the plasmid would contain a strong promoter such as, but not 
limited to, the CMV promoter. The synthetic human CEA gene or other gene to be expressed would be 
linked to such a promoter. An example of such a plasmid would be the mammalian expression plasmid 
VlJns as described (J. Shiver et. al. m DNA Vaccines, M. Liu et al. eds., N.Y. Acad. Sci., N.Y., 772:198- 

20 208 (1996), which is herein incorporated by reference). 

As stated above, an adenoviral vector vaccine and a plasmid vaccine may be 
administered to a vertebrate as part of a single therapeutic regime to induce an immune response. To this 
end, the present invention relates to a method of protecting a mammal from cancer coiiq>rising: (a) 
introducing into the mamTnal a first vector conq)rising: i) a synthetic codon-optimized polynucleotide 

25 encoding a human CEA protein or human CEA protein variant; and ii) a promoter operably linked to the 
polynucleotide; (b) allowing a predetermined amount of time to pass; and (c) introducing into the 
TnaTnTTial a second vector comprising: i) a synthetic codon-optimized polynucleotide encoding a human 
CEA protein or human CEA protein variant; and ii) a promoter operably linked to the polynucleotide. 

In one embodiment of the method of protection described above, the first vector is a 

30 plasmid and the second vector is an adenovirus vector. In an alternative embodiment, the first vector is 
an adenovirus vector and the second vector is a plasmid. 

The instant mvention fiutfaer relates to a metiiod of treating a mammal suffering from an 
adenocarcinoma comprising: (a) introducing into the mammal a first vector comprising: i) a synthetic 
codon-optimized polynucleotide encoding a human CEA protein or human CEA protein variant; and ii) a 

35 promoter operably linked to the polynucleotide; (b) allowing a predetermined amount of time to pass; and 
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(c) introducing into the n paTmnal a second vector coniprising: i) a synthetic codon-optimized 
polynucleotide encoding a human CEA protein or human CEA protein variant; and ii) a promoter 
operably linked to the polynucleotide. 

In one embodiment of the method of treatment described above, the first vector is a 

5 plasmid and the second vector is an adenovirus vector. In an alternative embodiment, the first vector is 
an adenovirus vector and the second vector is a plasmid. 

The amount of expressible DNA or transcribed RNA to be introduced into a vaccine 
recipient will depend partially on the strength of the promoters used and on the immunogenicity of the 
expressed gene product. In general, an immunologically or prophylactically effective dose of about 1 ng 

10 to 100 mg, and preferably about 10 jig to .300 \ig of a plasmid vaccine vector is administered directly into 
muscle tissue. An effective dose for recombmant adenovirus is approximately 106 - 10l2 particles and 
preferably about 10^ — lOHparticles. Subcutaneous injection, intradermal mtroduction, impression 
though the skin, and other modes of administration such as intraperitoneal, intravenous, or inhalation 
delivery are also conten:plated. It is also contenq>lated that booster vaccinations may be provided. 

15 Parenteral administration, such as intravenous, intramuscular, subcutaneous or other means of 
administration with adjuvants such as interleukin 12 protein, concurrently with or subsequent to 
parenteral introduction of die vaccine of this invention is also advantageous. 

The vaccine vectors of this invention may be naked, i.e., unassociated with any proteins, 
adjuvants or other agents which iiiq)act on the recipient's immune systenL In this case, it is desirable for 

20 the vacdne vectors to be in a physiologically acceptable solution, such as, but not limited to, sterile saline 
or sterile buffraed saline. Alternatively, it may be advantageous to administer an immunostimulant, such 
as an adjuvant, cytokine, protein, or other carrier with the vaccines or immunogenic compositions of the 
present mvention. Therefore, this mvention includes the use of such immunosthnulants in conjimction 
with the compositions and methods of the present invention- An immunostimulant, as used herein, refers 

25 to essentially any substance that enhances or potentiates an immune response (antibody and/or cell- 
mediated) to an exogenous antigen. Said inmiunostimulants can be administered in the form of DNA or 
protein. Any of a variety of inomunostimulants may be enq>loyed in conjunction with the vaccines and 
unmunogenic compositions of the present inventions, including, but not limited to: GM-CSF, IFNot, 
tetanus toxoid, IL12, B7.1, LFA-3 and ICAM-1. Said immunostimulants are well-known in the art. 

30 Agents which assist in the cellular uptake of DNA, such as, but not lunited to calcium ion, may also be 
used. These agents are generally referred to as transfection facilitating reagents and pharmaceutically 
acceptable carriers. Those of skill in the art will be able to detCTinine the particular immunostimulant or 
pharmaceutically acceptable carrier as weD as the appropriate time and mode of administration. 
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All publications mentioned herein are incorporated by reference for the purpose of 
describing and disclosing methodologies and materials that might be used in connection with the present 
invention. Nothing herem is to be construed as an admission that the invention is not entitled to antedate 
such disclosure by virtue of prior invention. 
5 Having described preferred embodiments of the invention with reference to the 

accompanying drawings, it is to be understood that the invention is not limited to those precise 
embodiments, and that various changes and modifications may be effected therein by one skilled m the art 
without departing firom the scope or spirit of the invention as defined in the appended claims. 
The following examples illustrate, but do not limit the invention. 

10 

EXAMPLE 1 

Human CEA optimized codon sequence. 

The entire hCEAopt coding sequence was synthesized and assembled by BIONEXUS 
(Oakland, CA). The hCEAopt cDNA, which carries an optimized Kozak sequence at its 5 '-end, was 
15 constructed using oligonucleotides assembled by PCfL The assembled cDNA was inserted into the pCR- 
Blunt vector Ohvitrogen. Carlsbad, CA), yielding pCR-hCEAopt The integrity of the hCEAopt cDNA 
was determined by sequencing of both strands. 

EXAMPLE2 

20 Plasmid Constructs and Adenovirus vectors. 

pVlJ/hCEAopt : Plasmid pCR-hCEAopt was digested with EcoKL for 1 hr at ST'C. The 
resulting 2156 bp insert was purified and cloned into the EcdRI site of plasmid pVlJnsB (Montgomery, et 
al.,DAM Cell Biol, 12(9):777-83(1993)). 

pVlJ/hCEA: Plasmid pCI/hCEA (Song et al. Regulation of T-helper-1 versus T-helper- 
25 2 activity and enhancen^nt of tumour immunity by combined DNA-based vaccination and nonviral 

cytokine gene transfer. Gene Therapy 7: 481-492 (2000)) was digested with EcoRL The resulting 2109 
bp msert was cloned into the EcdRI site of plasmid pVlJnsA (Montgomery et al., supra), 

Ad5/hCEAopt: Plasmid pCR-hCEAopt was digested with £coRI. The resulting 2156 bp 
insert was purified and cloned into the EcoBl of the polyMRK-Ad5 shutfle plasmid {See Emini et al., 
30 WO 02/22080, which is hereby mcorporated by reference). 

Ad5/CEA : The shutfle plasmid pMRK-hCEA for generation of Ad5 vector was obtained 
by digesting plasmid pDeltalsplB/hCEA with Sspl and EcoRY. The 9.52 kb fragment was then ligated 
with a 1272 bp BgUl-BamHL restricted, Klenow treated product from plasmid polyMRK. A PacVStul 
fragment from pMRK-hCEA and pMRK-hCEAopt containing the expression cassette for hCEA and El 
35 flanking Ad5 regions was recombined to Clal Imearized plasmid pAd5 in BJ5183 £. coli cells. The 
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resulting plasmids were pAd5-hCEA and pAdS-hCEAopt, respectively. Both plasmids were cut with 
PacI to release the Ad inverted terminal repeats (TTRs) and transfected in PerC-6 cells. Ad5 vectors 
an5)lification was carried out by serial passage. MRKAd5/hCEA and MRKAdS/hCEAopt were purified 
through standard CsQ gradient purification and extensively dialyzed against A105 buffer (5mM Tris-Q 
5 pH 8.0, ImM MgCl2, 75 niM NaCl, 5% Sucrose, 0.005 Tween 20). 

EXAMPLES 

CEA Expression and Detection. 

Expression of hCEA by the plasmid and Ad vectors was monitored by Westem blot 

10 analysis. Plasmids were transfected m HeLa cells or PerC.6 cells with lipofectamine 2000 (Life 
Technologies, Carlsbad, CA). Adenovirus infections of PerC.6 cells were performed in serum free 
medium for 30 min at 37* C, flien fitesh medium was added. After 48 hr mcubation, whole cell lysates 
and culture supernatant were harvested. The CEA protein present m the cell lysates was detected by 
Westem blot analysis usmg a rabbit polyclonal antiserunL The protein was detected as a 180-200 kDa 

15 band. The secreted CEA was detected in the cell supematants and in peripheral blood of mjected mice (3 
days post mjection) using the Direct Elisa CEA Kit (DBC-Diagnostics Biochem Canada Inc., Ontario, 
Canada). 

EXAMPLE4 

20 Mice immunization. 

Female C57BL/6 mice (Hr2^) weie purchased from Charles River (Lecco, Italy). 

CEAtg mice (H-2l>) were provided by J. Primus (Vanderbilt University) and kept in standard conditions. 

Fifty micrograms of plasmid DNA were electroinjected in a 50)11 volume in mice quadriceps as 

previously described (Rizzuto et al. Proc. Natl. Acad. Set USA. 96(11): 6417-22 (1999)). Ad injections 
25 were carried out in mice quadriceps in 50 pi volume. Humoral and cell mediated immune response were 

analyzed at the indicated tune. 

BXAMPLE5 

Codon optimized cDNA of hCEA significantiy increased hCEA expression. 

30 A synthetic gene of human OEA (hCEAopt) was designed to incorporate human- 

prefeired (humanized) codons for each amino acid (hereinafter aa) residue. The codon optimized cDNA 
was modified to maintain 76.8% nucleotide identity to die original clone (see FIGURE 1). The codon 
optimized cDNAs were cloned into the pVlJ vectors (Montgomery et al., supra), placing in front a Kozak 
optimized sequence (5'-GCCGCCACC-3\ SEQ ID NO: 13) and under the control of the human 

35 cytomegalovirus (CMV)/intron A promoter plus tiie bovine growth hormone (BGH) termination signal. 
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The construct was named pVlJ/hCEAopt (see EXAMPLE 2). Additionally, an Adenovirus type 5 vector 
was constructed carrying the hCEAopt sequence flanked by the CMVAntron A promoter and the BGH 
termmation signal (Ad5/hCEAopt). For comparison, the equivalent plasmid and Ad5 vectors were 
constructed carrying the wild type hCEA sequence yielding pVlJ/hCEA and AdS/hCEA. Similar to those 

5 containing the codon optimized cDNA, these vectors carry the wild type gene under the control of the 
CMVAmt A promoter with BGH termmation signal. 

Western blot analysis of HeLa cells transfected with plasmid pVl J/hCEAopt yielded a 
protein with large molecular mass (180-200 kDa) that was indistinguishable m size from that detected in 
cells transfected with construct pVlJ/hCEA. Similarly, no apparent differences could be detected in the 

10 size of the protem detected in PerC-6 cell lysates that had been infected with AdS/hCEA or 
Ad5H7hCEAopt (data not shown). 

To compare the efficiency of expression of the hCEAopt to that of hCEA, groups of 10 
C57BL/6 mice were injected mto the quadriceps with different doses of the Ad5/hCEAopt vector rangmg 
from lxl07 to lxl04 pfu. Three days post injection, CEA protem levels were determined and compared 

15 to those of control groiq)S that had been injected with the sanae doses of AdS/hCEA. A sixfold increase in 
the geometric mean values of hC3BA levels was observed upon injection of lxl07 pfii of Ad/hCEAopt 
(48.2 relative to the AdS/hCEA injected mice, whereas a tenfold mcrease in protein level was 
observed upon injection of lxl06 pfu of the same virus (19.1 \ig/l) (FIGURE 2A). In contrast, mjection 
of lower doses of Ad5/hCEAopt did not result m a substantial increase in circulating CEA levels as 

20 compared to AdS/hCEA. The enhancement of CEA protein levels was also noted, albeit to a lower 

extent, upon electromjection of 25 or 50 jig of plasmid pVlJ/hCEAopt relative to pVlJ/hCEA (FIGURE 
7B). Thus, these results indicate that, independently of the gene transfer vehicle utilized, the codon 
optimized cDNA is expressed with greater efficiency than the corresponding wild type sequence. 

25 EXAMPLE6 
IFN-Y ELISPOT Assay. 

Ninety-six wells MAIP plates (Millipore, Bedford, MA) were coated with 100 jil/ well of 
purified rat anti-mouse IFN-Y (IgGl, clone R4-6A2, Pharmmgen, San Diego, CA) diluted to 2.5 (ig/ml in 
sterile PBS. After washing with PBS, blocking of plates was carried out with 200 [id/well of RIO medium 

30 for 2 hrs at 37*^0. 

Splenocytes were obtained by removing the spleen from the euthanized mice in a sterile 
manner. Spleen disruptionwas carried out by gratmg the dissected spleen on a metal grid. Red blood 
cells were removed by osmotic lysis by adding 1 ml of O.IX PBS to the cell pellet and vortexing no more 
than 15 seconds. One ml of 2X PBS was then added and the volume was brought to 4 ml with IX PBS. 
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Cells were pelleted by centrifiigation at 1200 rpm for 10 nnn at room temp., and the pellet was 
resuspended in 1 ml RIO medium- Viable cells were counted using Tiirks staining. 

Splenocytes were plated at 5x10^ and 2x10^ cells/well in duplicate and incubated for 20 
h at 37**C with lM.g/ml suspension of each peptide. Concanavalin A (ConA) was used as positive internal 

5 control for each mouse at 5 jig^ml. After washing with PBS, 0.05% Tween 20, plates were incubated 0/N 
at 4**C with 50 jU/well of biotin-conjugated rat anti-mouse IFNy (RatlgGl, clone XMG 1.2, PharMingen) 
diluted to 1:2500 in Assay buffer. After extensive washing, plates were developed by adding 50 (il/well 
NBT/B-CIP (Pierce Biotechnology Inc., Rockford, IL) until development of spots was clearly visible. 
The reaction was stopped by washing plates thoroughly with distilled water. Plates were air dried and 

10 spots were then counted using an automated ELISPOT reader. 

EXAMPIJE7 

IhtraceUular Cytokine Staining. 

One to two million mouse splenocytes or PBMC in 1ml RPMI 10% PCS were incubated 

15 with a pool of peptides (5-6 p-g/ml final concentration of each peptide) and brefeldui A (1 jig/ml; BD 
Pharmingen cat #555028/2300kk) and 5% CO2 for 12-16 hours at 37°C. Cells were then washed with 
FACS buffer (PBS 1% FBS, 0.01% NaNs) and incubated witii purified anti-mouse CD16/CD32 Fc block 
(BD Pharmingen cat # 553142) for 15 min at 4°C. Cells were then washed and stained witii surface 
antibodies: CD4-PB conjugated anti-mouse (BD Phamiingen, cat.# 553049), PercP CD8 conjugated anti 

20 mouse (BD Pharmingen cat# 553036) and APC- conjugated anti-mouse CD3e (BD Pharmmgen cat# 
553066) for 30 noinutes at room ten^rature in the dark. After the washing, cells were fixed and 
permeabiKzed witii Cytofix-Cytoperm Solution (BD Pharmingen cat #555028/23001dc) for 20 min at 4°C 
in the dark. After washing witii PermWash Solution (BD Pharmingen cat #555028/2300kk) cells were 
mcubated with the IFNy-FITC antibodies CBD Pharmingen). Cells were then washed, fixed with 

25 formaldehyde 1% in PBS and analyzed on a FACS-Calibur flow cytometer, using CellQuest software 
(Becton Dickinson, San Jose, CA). 

EXAMPLES 

Identification and characterization of epitope containing peptides for direct enumeration of CEA-specific 
30 T cells 

To better characterize the immune response elicited upon genetic vaccination against 
CEA in mice, ELISPOT analysis was carried out on C57BL/6 and B ALB/c mice to identify CD4+ and 
CD8+ CEA specific epitopes. To this end, different immunization modalities were compared to generate 
highly immunized mice that could be utilized to identify responses to individual peptides that cover the 
35 entire protein. In view of recent reports that indicate that high levels of cellular immunity can be induced 
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against viral and bacterial antigens by utUizing plasmid DNA prime-Ad boost modality, the same 
immunization protocol was enq)loyed in this study. Mice were unmunized intramuscularly by different 
regimens: i) two doses of lxl09 vp of Ad/hCEA (Ad/Ad), u) two doses of plasmid pVlJ/hCEA 
(DNA/DNA) and in) a dose of plasmid DNA foUowed by Ad/hCEA (DNA/Ad). The immunizations 
5 were two weekis apart. 

The cellular immunity elicited by the different immunization regknes was measured by 
ELISPOT assay 2 weeks after the boost To con5)are the immunogenic efficiency of the different 
vaccmation regimens, a pool of 15mer peptides overlapping by 1 1 aa and covering aa 497-703 (pool D) 
were used to stimulate antigen specific cytokine secretion from splenocytes. The most vigorous 
10 responses, indicated by the higher geon^tric mean values of the SFC, were observed in C57BL/6 and 
BALB/c mice from the DNA/Ad injected group (FIGURE 4). Thus, this regimen was utilized to further 
analyze the immune response. 

To determine whether the immune response was equally distributed across the enture 
CEA protem, splenocytes from immunized C57BL/6 and BALB/c mice were stunulated in vitro with one 
15 of four pools of 15-mer peptides that collectively encon^ass the entire protein sequence. Each pool 

consisted of peptides 15 amino acids long that overlap by II residues. Lyophilized hCEA peptides were 
purchased from Bio-Synthesis (Lewisville, TX) and resuspended in DMSO at 40 mg/ml. In addition to 
pool D, pools A (aa 1 to 147), B (aa 137 to 237), and C (aa 317 to 507) were used in this study. Final 
concentrations were the followmg: pool A=1.2 mg/ml, pool B= 0.89 mg/ml, pool C= 0.89 mg/ml, pool 
20 D= 0.8 mg/ml. Peptides were stored at -SO^'C. 

The immune response elicited by the DNA/Ad vaccination reghnen in C57BLy6 mice 
was primarily biased towards the C-terminal region of the protein (see FIGURE 5 A). Significant SFC 
values were obtained with peptide pool C and D (geometric mean values: 170 and 244 SFQIO^ 
splenocytes, respectively), whereas pool A and B yielded much lower values (10 and 27 SFC/IO6 
25 splenocytes, respectively), to contrast, the immune response in BALB/c mice was highest with pool B 
(geometric mean value: 1236 SFai06 splenocytes), although pool A, C, and D showed significant SFC 
values (93, 263, and 344 respectively) (HGURE 5B). No responses against a pool of unrelated peptides 
were noted in both groups of mice (data not shown). 

To identify the individual peptides present in the peptide pools that elicit the responses, 
30 spleens from 4 mice immunized with the DNA/Ad vaccination regimen were analyzed in a IFNy- 

ELISPOT assay against each of the mdividual peptides comprising the pools agamst which a significant 
immune response had been observed. Splenocytes from C57BL/6 mice were tested against peptides 80 to 
173 mcluded in pool C and D. Splenocytes from B AIJ5/c mice were tested agamst peptides 35 to 173 
that comprise pools B, C, and D. CEA specific responses in C57BL/6 mice were mapped to four pairs of 
35 15-mer peptides that had overlapping sequences (aa 431 to 435 and 425 to 439; 529 to 543, and 533 to 
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547; 565 to 579, and 569 to 593; 613 to 627 and 617 to 631) (FIGURE 6A). The immune response to 
CEA in BALB/c mice was mapped to 22 different peptides, 17 of which have overlapping sequences (aa 
213 to 227, and 213 and 227; 229 to 243, and 233 to 247; 409 to 423 and 413 to 427; 421 to 435 and 425 
to 439; 565 to 579 and 569 to 583; 573 to 587; 613 to 627 and 617 to 631, and 621 to 635 and 625 to 639; 
637 to 65 1 and 641 to 655) (FIGURE 6B). 

To delBne the T-cell specificity of the epitopes contained within the selected peptides, 
IFNy intracellular staining assay was carried out on splenocytes from injected mice. The results obtained 
are shown in FIGURE 7. The data indicate ttiat CD8+ and CD4+ specific epitopes have been identified 
for both C57BU6 and BALB/c which can be used to quantify circulating levels of T-lymphocytes. 

EXAMPLE9 

Codon optimized hCEA cDNA breaks tolerance m hCEA transgenic mice. 

To determine whether the enhanced iromunogenic properties of the codon optimized 
cDNA of hCEA would break tolerance to human CEA more efficiently, hCEA transgenic mice were 
immunized with vectors carrying either the wild type or the codon optibtnized hCEA sequences. These 
transgenic mice carry the entire human CEA gene plus flanking sequences and express the hCEA protein 
in the cecum and colon. Thus, this mouse Ime is a useful model for studying the safety and efficacy of 
immunotherapy strategies dnected against this tumor self antigen (Clarke et al. Mice transgenic for 
human CEA as a model for immunotherapy. Cancer Res. 58(7): 1469-77 (1998)). 

As a first test, groups of 5 to 10 transgenic naice were subjected to four electroinjections 
of 50 \ig plasmid DNA followed by a final mjection of IxlQlO pp of Adenovirus. The immune response 
to hCEA was analyzed by IFNy-HLISPOT assay on pooled splenocytes from 4 injected mice. The 
immune response to hCBA was detected only with the splenocytes from the mice immunized with the 
hCEAopt cDNA (see FIGURE 8). The immune response was detected with peptides 143 and pool D, 
suggesting that immunization had elicited a significant CD8+ response to the C-terminal epitopes. 

The enhanced incununogenicity of the codon optimized cDNA of hCEA was also tested in 
transgenic mice using two mjections of IxlOlO pp of Adenovirus vectors two weeks apart. The CEA- 
specific immune response was measured by IFNy intracellular staining on pooled PBMC from 4 
immunized mice. The immune response to hCEA was detected only in mice immunized with Ad/CEAopt 
(FIGURE 9). As observed with the DNA plus Ad cohort, mduction of CD8+ T-cells was detected with 
peptide pool D; however, a significant CD8+ response was also noted with peptide pool A. Thus, these 
results mdicate that the codon optimized cDNA of hCEA is more immunogenic and breaks tolerance to 
hCEA more efficientiy than the wild type sequence. 

EXAMPLE 10 
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Antibodies Detection and Titration. 

Sera for antibody titration were obtained by retro-orbital bleeding. ELIS A plates (Nunc 
maxisorp™) were coated with 100 ng/well with CEA protein (highly pure CEA; Fitzgerald Industries 
International Inc., Concord MA), diluted in coating buffer (50mM NaHCOs, pH 9.4) and incubated 0/N 

5 at 4**C. Plates were then blocked with PBS containing 5% BSA for 1 hr at 37°C. Mouse sera were diluted 
in PBS 5% BSA (dilution 1/50 to evaluate seroconversion rate; dilutions from 1:10 to 1:3 1,2150 to 
evaluate titer). Pre-immune sera were used as background. Diluted sera were incubated 0/N at 4**C. 
Washes were carried out with PBS 1% BSA, 0.05% Tween 20. Secondary antibody (goat anti-mouse, 
IgG Peroxidase, Sigma) was diluted 1/2000 in PBS, 5% BSA and incubated 2-3 hr at room temp, on a 

10 shaker. After washing, plates were developed with 100 jil/well of TMB substrate (Pierce Biotechnology, 
Inc., Rockford, IL). The reaction was stopped with 25 jil/well of IM H2SO4 solution and plates were 

read at 450 nm/620 nnt Anti-CEA serum titers were calculated as the reciprocal limiting dilution of 
serum producing an absorbance at least 3-fold greater than the absorbance of autologous pre-immune 
serum at the same dilution. 

15 

EXAMPLE 11 

Increased bnmunogenicity of hCEAopt 

To examine in vfvo irmnune responses induced by the wild type and codon optimized 
CEA expression vectors, C57BL/6 mice were immunized intramuscularly with different doses of 

20 Ad5/hCEAopt ranging from lxl05 to lxl03 pfia. As con:5)arison, groiq)S of 8 to 10 mice were 

immunized with AdS/hCEA in doses ranging from lxl06 to lxl04 pfu. Mice were subjected to two 
injections three weeks apart Two weeks after the second immunization, splenocytes were isolated from 
each mouse. To quantify the IFNy secretmg CEA-specific CDS T-cell precursor frequencies generated by 
the Adenovirus mediated immunization, the ELISPOT assay for the H-2b restricted T-cell epitope 

25 CGIQNSVSA (SEQ ID NO: 14, see below) was used. Immunization with IxlO* pfu elicited a 

measurable immune response yielding 53 IFNy spot forming cells (SFC, geometric mean value) specific 
for the CGIQNSVSA epitope (SEQ ID NO: 14), whereas mjection of lxl03 pfu eUcited negligible SFC 
values (FIGURE 3A). The SFC increased to 302 in the group immunized with lxl05 pfu of 
Ad/hCEAopt. la contrast, at least lxl05 pfu of Ad5yhCEA were necessary to elicit a significant CDS T- 

30 cell precursor frequencies that mcreased to 168 SFC in the mouse group immunized with a dose of lxl06 
pfu. No peptide-specific IFNy SFC were detected in the Ad5 immunized mice (data not shown). 

Sera from mice immunized with lxl05 pfu of each hCBA Adenovirus vector were tested 
m ELISA using the purified human CEA protein as substrate (FIGURE 3B). CEA-specific antibody titer 
in Ad5/hCEAopt immunized mice was detected in all immunized mice and the geometric mean value of 

35 the Ab titer was 46,474. In contrast, die Ad5/hCEA unmunized group showed an approximately 100 fold 
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lower geometric mean titer of CEA-specific antibody (454). Thus, these results demonstrate that the 
codon opthnized cDNA of CEA is more efficient m eliciting an cellular and humoral inmiune response. 

EXAMPLE 12 

S Statistical Analysis. 

Where indicated, results were analyzed by the Student t test. A p value < 0.05 was 
considered significant 
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WHAT IS CLAIMED IS: 

1. A synthetic nucleic acid molecule comprising a sequence of nucleotides that 
encodes a human carcinoembryonic antigen (CEA) protein as set forth in SEQ ID N0:2, the synthetic 

5 nucleic acid molecule being codon-optimized for high level expression in a hmnan cell. 

2. The synthetic nucleic acid molecule of claim 1 wherein the nucleic acid is DNA. 

3. The synthetic nucleic acid molecule of claim 1 wherein the nucleic acid is 



10 mRNA. 



cDNA. 



4. The synthetic nucleic acid molecule of claim 1 wherein the nucleic acid is 



15 5. The synthetic nucleic acid molecule of claim 1 wherein the sequence of 

nucleotides comprises the sequence of nucleotides set forth in SEQ ID NO: 1. 

6. A vector conqprising the nucleic acid molecule of claim 1. 

20 7. A host ceD conq)rising the vector of claim 6. 

8. A process for expressing a human carcinoembryonic antigen (CEA) protein in a 
recombinant host cell, comprising: 

(a) introducing a vector conq>rising the nucleic acid of claim 1 into a suitable host 

25 cell; and, 

(b) culturing the host cell under conditions which allow expression of said human 

CEA protein. 

9. A method of preventing or treating cancer comprising administering to a 

30 mattimal a vaccine vector con^jrising a synthetic codon-optimized nucleic acid molecule, the nucleic acid 
molecule comprising a sequence of nucleotides that encodes a human carcinoembryonic antigen (hCEA) 
protein as set forth in SEQ ID N0:2 or a CEA protem variant as set forth in SEQ ID NO: 16. 

10. A method according to claim 9 wherein the mammal is human. 

35 
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11. A method according to claim 9 wherein the vector is an adenovirus vector or a 

plasmid vector. 

12. A method according to claim 9 wherein the vector is an adenoviral vector 
5 comprising an adenoviral genome with a deletion in the adenovirus El region, and an insert in the 

adenovirus El region, wherein the insert comprises an expression cassette comprising: 

(a) a codon-optimized polynucleotide encoding a human CEA protein or variant 

thereof; and 

(b) a promoter operably linked to the polynucleotide. 

13. A method according to claim 9 wherein the vector is a plasmid vaccine vector, 
which compiises a plasmid portion and an expressible cassette comprising 

(a) a codon-optimized polynucleotide encoding a human CEA protein or variant 

thereof; and 

15 (b) a promoter operably linked to the polynucleotide. 

14. An adenovirus vaccme vector comprising an adenoviral genome with a deletion 
in the El legion, and an insert in the El region, wherein the insert comprises an expression cassette 
comprising: 

20 (a) a codon-optimized polynucleotide encoding a human CEA protein or variant 

thereof; and 

(b) a promoter operably linked to the polynucleotide. 

15. An adenovirus vector according to claim 14 which is an Ad 5 vector. 

25 

16. An adenovirus vector according to claim 14 which is an Ad 6 vector. 

17. An adenovirus vector according to claim 14 which is an Ad 24 vector. 

30 18. A vaccine plasmid comprising a plasmid portion and an expression cassette 

portion, the expression cassette portion comprising: 

(a) a codon-optimized polynucleotide encoding a human CEA protein or variant 

thereof; and 

(b) a promoter operably linked to the polynucleotide. 

35 
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19. A method of protecting a mammal from cancer comprising: 

(a) introducing into the mammal a first vector comprising: 

(i) a codon-optimized polynucleotide encoding a human carcinoembryonic 
antigen (CEA) protein or variant thereof; and 
5 (ii) a promoter operably linked to the polynucleotide; 

(b) allowing a predetermined amount of time to pass; and 

(c) introducing into the mammal a second vector comprising: 

(i) a codon-optimized polynucleotide encoding a human CEA protein or 

variant thereof; and 

10 (ii) a promoter operably linked to the polynucleotide. 



20. A method according to claim 19 wherein the &st vector is a plasmid and the 
second vector is an adenovirus vector. 

15 21. A method according to clahn 19 wherehoi the first vector is an adenovirus vector 

and the second vector is a plasnoid. 

22. A method of treating a mammal suffering from a colorectal carcinoma 

comprising: 

20 (a) mtroducing into the mammal a first vector comprising: 

(i) a codon-optimized polynucleotide encoding a human CEA protein or 

variant thereof; and 

(ii) a promoter operably linked to the polynucleotide; 
(b) allowing a predetennined amount of time to pass; and 

25 (c) introducing into the mammal a second vector comprising: 

(i) a codon-optimized polynucleotide encoding a human CEA protein or 

variant thereof; and 

(ii) a promoter operably linked to the polynucleotide. 

30 23. A method according to claim 22 wherein the furst vector is a plasmid and the 

second vector is an adenovirus vector. 

24. A method according to clahn 22 wherein the first vector is an adenovirus vector 
and the second vector is a plasmid. 

35 
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25. A synthetic nucleic acid molecule comprising a sequence of nucleotides that 
encodes a human carcinoembryonic antigen (CEA) protein variant as set forth in SEQ ID NO: 16, the 
synthetic nucleic acid molecule being codon-optimized for high level expression in a human cell. 

26. The synthetic nucleic acid molecule of claim 25 wherein the sequence of 
nucleotides comprises the sequence of nucleotides set forth in SEQ ID NO: 15. 



27. A vector comprising the nucleic acid molecule of claim 25. 

28. A host cell conaprising the vector of claim 27. 
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FIGURE 1. Human Codon-Optimized CEA Sequence 

1 MetGluSerProSerMaProProHisArgTrpCysIleProTrpGlxiArgLeuLeuLeu 2 0 

1 atggagtctccctcggcccctccccacagatggtgcatcccctggk::agaggctcctgctc 6 0 

AGC. . .AGO C .C.C C.C. .G G 

21 ThrAlaSerLeuLeuThrPheTrpAsnProProThrThrAlaLysLeuThrlleGluSer 40 

6 1 ACAGCCTCACTTCTAACCTTCTGGAACCCGCCCyiCCACTGCCAAGCTCACTAOT^ 12 0 

..C...AGC. .G. .G C C G.. C.C.. GAG. 

41 ThrProPheAsnValAlaGluGlyLysGluValLeiiLeiiLeuValHisAsnLeuProGln 60 

121 ACGCCGTTCAATGTCGCAGAGGGGAAGGAGGTGCTTCTACTTGTCCACAATCT^ 180 

..C.C C .G. .C C G. .G. .G- .G C 

61 HisLeuPheGlyTyrSerTrpTyrLysGlyGliiArgValAspGlyAsnArgGlnllelle 80 

181 CATCTTTTTGGCTACAGCTGGTACAAAGGTGAAAGAGTGGATGGCAACCGTCAAATTAT^ 240 

. .C. .G. .C G. .C. .GCC C C .G. .C. .C 

81 GlyTyrVallleGlyThrGlnGlnAlaThrProGlyProAlaTyrSerGlyArgGluIle 100 

241 GGATATGTAATAGGAACTCAACAAGCTACCCCAGGGCCCGCATACAGTGGTCGAGAGATA 300 

. .C .C .G. .C .C .C .G. .G. .C C .C C C .C .C C 

101 ileTyrProAsnAlaSerLeiiLeuIleGlriAsnllelleGlnAsnAspThrGlyPheTyr 120 

301 ATATACCCCAATGCATCCCTGCTGATCCAGAACATCATCCAGAATGACACAGGATTCTAC 360 

. .C C .CAG C C .C 

121 ThrLetoHi sVall leLy sSerAspLeuValAsnGluGluAlaThrGlyGlnPheArgVal 140 
3 61 ACCCTACACGTCATAAAGTCAGATCTTGTGAATGAAG7UVGCAACTGGCCAGTTCCGGGTA 420 
G G..C. ,AGC .C .G C .G. .G- .C. .C C .G 

141 TyrProGluLeuProLysProSerlleSerSerAsnAsnSerLysProValGliiAspLys 160 
421 TACCCGGAGCTGCCCAAGCCCTCCATCTCCAGCAACAACTCCAAACCCGTGGAGGACAAG 480 
C AG AG AG. . .G. . 

161 AspAlaValAlaPheThrCy sGluProGluThrGliiAspAlaThrTyrLeuTrpTrpVal 180 

481 GATGCTGTGGCCTTCACCTGTGAACCTGAGACTCAGGACGCAACCTACCTGTGGTGGGTA 540 

. .C .C C .G. .C C C G 

181 AsnAsnGlnSerLeuProValSerProArgLeuGlnLeuSerAsnGlyAsnArgThrLeu 200 
541 AACAATCAGAGCCTCCCGGTCAGTCCCAGGCTGCAGCTGTCCAATGGCAACAGGACCCTC 600 
C G..C.G..C..CC AG. . .C CC G 

201 ThrLeuPheAsnValThrArgAsnAspThrAlaSerTyrLysCysGluThrGlnAsnPro 22 0 

601 ACTCTATTCAATGTCACAAGAAATGACACAGCAAGCTACAAATGTGAAACCCAGAACCCA 660 

. .C .G C .G. .CCC .C C .C G. .C .G .C 

221 ValSerAlaArgArgSerAspSerVallleLeviAsnValLeuTyrGlyProAspAlaPro 240 
661 GTGAGTGCCAGGCGCAGTGATTCAGTCATCCTGAATGTCCTCTATGGCCCGGATGCCCCC 720 
C . .CC C .CAGC .6 C .G. .G. .C C .C 

241 ThrlleSerProLeuAsnThrSerTyrArgSerGlyGluAsiiLeuAsnLeuSer^ 260 
721 ACCATTTCCCCTCTAAACACATCTTACAGATCAGGGGAAAATCTGAACCTCTCCTGCCAC 780 
CAG. . .C .G CAGC . .CCAGC .C. .G. .C GAG 

261 AlaAlaSerAsnProProAlaGlnTyrSerTrpPheValAsnGlyThrPheGlnGlnSer 280 

781 GCAGCCTCTT^CCCACCTGCACAGTACTCTTGGTTTGTCAATGGGACTTTC 840 

. .C - - AGC C .C .C AGC C -G. .C .C .C GAG. 

281 ThrGlnGluLeuPhelleProAsnlleThrValAsnAsnSerGlySerTyrThirCysGl^ 300 
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841 ACCCAACSAGCTCTTTATCCCCAACATCACTGTGAATAATAGTGGATCCTATACGTGCCJ^ 900 
G G. .C C C. .C. .C- .CAG. . .C. .C G 

301 AlaHi s AsnSerAspThrGlyLexiAsnArgThrThrValThrThr I leThrVal^ 320 
901 GCCCATAACTCAGACACTGGCCTCAATAGGACCACAGTCACGACGATCTICAGTCTA 960 
C. , .AGO C G. .CC.C C. .G. .C. -C C. .G. .C- -C 

321 GluProProLysProPhelleThrSerAsnAsnSerAsnProValGluAspGluAspAla 340 
961 GAGCCACCCAAACCCTTCATCACCAGCAACAACTCCAACCCCGTGGAGGATGAGGAT^ 1020 
C G AG C C. .C 

341 ValAlaLeuThrCysGluProGluIleGliiAsnThrThrTyrLeuTrpTrpValAsnAsn 3 60 
1021 GTAGCCTTAACCTGTGAACCTGAGATTCAGAACACAACCTACCTGTGGTGGGTAAATAAT 1080 
. .G. • -C^G C..G..C C C G. .C. .C 

361 GlnSerLeuProValSerProArgLeuGlnLeuSerAsnAspAsnArgThrLeuThrLeu 380 
1081 CAGAGCCTCCCGGTCAGTCCCAGGCTGCAGCTGTCCAATGACAACAGGACCCTCACTCTA 1140 
G- .C. .G. .C. - .C.C AG. . -C C.C G. .C. .G 

381 IieuSerValThrArgAsnAspValGlyProTyrGluCysGlylleGlnAsnGluLeuSer 400 
1141 CTCAGTGTCACAAGGAATGATGTAGGACCCTATGAGTGTGGAATCCAGAACGAATTAAGT 1200 
. .G. .C. -G. .CC.C. .C. -C- .G. .C C C. .C GC.G. .C 

401 ValAspHisSerAspProVallleLeiiAsnValLeuTyrGlyProAspAspProThrlle 42 0 
1201 GTTGACCACAGCGACCCAGTCATCCTGAATGTCCTCTATGGCCCAGACGACCCCACCATT 1260 
. .G C. .G C. -G. .G. -C C . . .C 

421 SerProSerTyrThrTyrTyrArgProGlyValAsnLeuSerLeuSerCysHisAlaAla 440 
1261 TCCCCCTCATACACCTATTACCGTCCAGGGGTGAACCTCAGCCTCTCCTGCCATGCAGCC 1320 
AG AGC C C. .C. .C G GAG C.C 

441 SerAsnProProAlaGlnTyr SerTrpLeuIleAspGlyAsnlleGlnGlnHi sThrGln 460 
1321 TCTAACCCACCTGCACAGTATTCTTGGCTGATTGATGGGAACATCCAGCAACACACAC^ 1380 
AGC C. .C. .C CAGC C. .C. .C G C. .G 

461 GliiLeuPhel leSerAsnlleThrGluLysAsnSerGlyLeuTyrThrCysGliiAlaAsii 480 
1381 GAGCTCTTTATCTCCAACATCACTGAGAAGAACAGCGGACTCTATACCTGCCAGGCCAAT 1440 
G. .C. . .AG C C. .G. .C C 

481 AsnSerAlaSerGlyHisSerArgThrThrValLysThrlleThrValSerAlaGluLeu 500 
1441 AACTCAGCCAGTGGCCACAGCAGGACTACAGTCAAGACAATCACAGTCTCTGCGGAGCTG 1500 
. . .AGC C C.C. .C. .C. .G C C- .GAGC. .C 

501 ProLy s ProSer I leSer SerAsnAsnSerLy s ProValGliiAspLy s AspAl aVal Ala 520 
1501 CCCAAGCCCTCCATCTCCAGCAACAACTCCAAACCCGTGGAGGACAAGGATG^ 1560 
AG AG AG. . .G C. .C 

521 PheThrCysGluProGliiAlaGlnAsnThrThrTyrLeuTrpTrpValAsnGlyGlnSer 540 
1561 TTCACCTGTGAACCTGAGGCTCAGAACACAACCTACCTGTGGTGGGTAAATGGTCAGAGC 1620 
C. .G. .C C C G. .C. .C 

541 LeuProValSerProArgLeuGlnLeuSerAsnGlyAsnArgThrLeuThrLeuPheAsn 560 
1621 CTCCCAGTCAGTCCCAGGCTGCAGCTGTCCAATGGCAACAGGACCCTCACTCTATTCAAT 1680 
. .G. .C. .G. .C. . .C.C AG. . .C C.C G. .C. .G C 

561 ValThrArgAsnAspAlaArgAlaTyrValCysGlyl leGlnAsnSerValSerAlaAsn 580 
1681 GTCACAAGAAAl^CGCAAGAGCCTATGTATGTGGAATCCAGAACTCAGTGAGTGCAAAC 1740 
. .G. .CC.C. .C CC.C C. .G. .C. .C AGC C. .C. . . 



wo 2004/099247 



3/18 



PCT/EP2004/004802 



581 ArgSerAspProValThrLeuAspValLeuTyrGlyProAspThr Prol lelleSer Pro 600 
1741 CGCAGTGACCCAGTCy^CCTGGATGTCCTCTATGGGCCGGACACCCCC^^ 1800 
C C. .G C. .G. .G- .C. .C. .C CAG 

601 ProAspSerSerTyrLeuSerGlyAlaAsnLexiAsnLeuSerCysHisSerAlaSerAsn 620 
1801 CCAGACTCGTCTTACCTTTCGGGAGCGAACCTCAACCTCTCCTGCCACTCGGCCTCTAAC 1860 
. .C. . .AGCAGC GAGC. .C. .C G GAG AGO- • -AGC. . , 

621 ProSer ProGlnTyr SerTrpArgI leAsnGlyl leProGlnGlnHi sThrGlnValLeu 640 
1861 CCATCCCCGCAGTATTCTTGGCGTATCAATGGGATACCGCAGCAACACACACAAGTTCTC 1920 
. .CAG. . .C CAGC C C. .C. .C. .C G C. .G. .G. .G 

641 PhelleAlaLysIleThrProAsnAsnAsnGlyThrTyrAlaCysPheValSerAsnlieu 660 
1921 TTTATCGCCAAAATCACGCCAAATAATAACGGGACCTATGCCTGTTTTGTC 1980 
. .C G C. .C. .C- .C C C C. .C. .GAGC. . .C. . 

661 AlaThrGlyArgAsnAsnSerlleValLysSerlleThrValSerAlaSerGlyThrSer 680 
1981 GCTACTGGCCGCAATAATTCCATAGTCAAGAGCATCACAGTCTCTGCATCTGGAACTTCT 2040 
_C. .C C. .CAG. ..C..G C. .GAGC. .CAGC. .C. .CAGC 

681 ProGlyLeuSerAlaGlyAlaThrValGlylleMetlleGlyValLeuValGlyValAla 700 
2041 CCTGGTCTCTCAGCTGGGGCCACTGTCGGCATCATGATTGGAGTGCTGGTTGGGGTTGCT 2100 
. .C. .C. -GAGC. .C. .C C. .G C. .C G. .C. .G. .C 

701 LeuIleEnd 703 (SEQ ID NO: 2) 
2101 CTGATATAG 2109 (SEQ ID NO: 3) 
C.GA (SEQ ID N0:1) 
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FIGURE 3A. Cell-Mediated Immune Reiq>oiise to Human CEA 
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FIGURE 3B. Humoral Immune Response to Human CEA 
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FIGURE 4A. Comparison of Tmiminlzation Regimens Used for C57BL/6 Mice 
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FIGURE 4B. Comparison of Iiiimiimzatioii Reg^ens Used for BALB/c Mice 
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FIGURE 5A. T-CeU Responses in C57BL/6 Mice 
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FIGURE 5B. T-Cell Responses in BALB/c Mice 
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FIGURE 6A. Inmnmoresponsiye P^fides of hCEA in Inmnuiized C57BL/6 Mice 
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FIGURE 6B. ImmimoresponsiTe Peptides of hCEA in Immunized BALB/c Mice 
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FIGURE 7. Sequence of Epitope-Containing Peptides for CEA 



A. C57BL/6 



Pool 


Peptide 


Position Sequence 


SEQIDNO. 


CDS 


CD4 


c 


CEA-107 


425-439 TYYRPGVNLSLSCHA 


4 


0.02 


0.14 


D 


CEA-133 


539-543 NTTYLWWVNGQSLPV 


5 


0.01 


0.04 


D 


CEA-143 


569-583 YVCGIQNSVSANRSD 


6 


0.64 


0.01 


D 


CEA-155 


617-631 SASNPSPQYSWRING 


7 


0.03 


0.06 




DMSO 






0.01 


0.01 



B.BALB/C 



Pool 


Peptide 


Position Sequence 


SEQIDNO. 


CDS 


CD4 


B 


CEA-58 


229-243 VlLNVLYGPDAFnS 


8 


0.51 . 


0.01 


c 


CEA-98 


389-403 GPYECGIQNELSVDH 


9 


0.01 


0.14 


c 


CEA-106 


421-435 SPSYTYYRPGVNLSL 


10 


0.01 


0.16 


D 


CEA-156 


621-635 PSPQYSWRINGIPQQ 


11 


0.01 


0.12 


D 


CEA-167 


665-679 NNSIVKSnVSASGT 


12 


0.13 


0.03 




DMSO. 






.0.01 


0.01 



wo 2004/099247 



15/18 



PCT/EP2004/004802 



FIGUKE 8. IFNy-ELISPOT analysis of immunized CEA.^ mice' 
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FIGURE 9. IFNy -Intracellular Staining of Tmmmni7.ed CEA.^ Mice 
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FIGURE lOA. Nucleotide Sequence of Human CEA*AAD 

1 ATGGAGAGCC CCAGCGCCCC CCCCCACCGC TGGTGCATCC CCTGGCAGCG 
CCTGCTGCTG ACCGCCAGCC TGCTGACCTT CTGGAACCCC CCCACCACCG 
101 CCAAGCTGAC CATCGAGAGC ACCCCCTTCA ACGTGGCCGA GGGCAAGGAG 
GTGCTGCTGC TGGTGCACAA CCTGCCCCAG CACCTGTTCG GCTACAGCTG 
201 GTACAAGGGC GAGCGCGTGG ACGGCAACCG CCAGATCATC GGCTACGTGA 
TCGGCACCCA GCAGGCCACC CCCGGCCCCG CCTACAGCGG CCGCGAGATC 
301 ATCTACCCCA ACGCCAGCCT 6CTGATCCAG AACATCATCC AGAACGACAC 
CGGCTTCTAC ACCCTGCACG TGATCAAGAG CGACCTGGTG AACGAGGAGG 
401 CCACCGGCCA GTTCCGCGTG TACCCCGAGC TGCCCAAGCC CAGCATCAGC 
AGCAACAACA GCAAGCCCGT GGAGGACAAG GACGCCGTGG CCTTCACCTG 
501 CGAGCCCGAG ACCCAGGACG CCACCTACCT GTGGTGGGTG AACAACCAGA 
GCCTGCCCGT GAGCCCCCGC CTGCAGCTGA GCAACGGCAA CCGCACCCTG 
601 ACCCTGTTCA ACGTGACCCG CAACGACACC GCCAGCTACA AGTGCGAGAC 
CCAGAACCCC GTGAGCGCCC GCCGCAGCGA CAGCGTGATC CTGAACGTGC 
701 .. TGTACGGCCC CGACGCCCCC ACCATCAGCC CCCTGAACAC CAGCTACCGC 
AGCGGCGAGA ACCTGAACCT GAGCTGCCAC GCCGCCAGCA ACCCCCCCGC 
801 CCAGTACAGC TGGTTCGTGA ACGGCACCTT CCAGCAGAGC ACCCAGGAGC 
TGTTCATCCC CAACATCACC GTGAACAACA GCGGCAGCTA CACCTGCCAG 
901 GCCCACAACA GCGACACCGG CCTGAACCGC ACCACCGTGA CCACCATCAC 
CGTGTACGCC GAGCCCCCCA AGCCCTTCAT CACCAGCAAC AACAGCAACC 
1001 CCGTGGAGGA CGAGGACGCC GTGGCCCTGA CCTGCGAGCC CGAGATCCAG 
AACACCACCT ACCTGTGGTG GGTGAACAAC CAGAGCCTGC CCGTGAGCCC 
1101 CCGCCTGCAG CTGAGCAACG ACAACCGCAC CCTGACCCTG CTGAGCGTGA 
CCCGCAACGA CGTGGGCCCC TACGAGTGCG GCATCCAGAA CGAGCTGAGC 
1201 GTGGACCACA GCGACCCCGT GATCCTGAAC GTGCTGTACG GCCCCGACGA 
CCCCACCATC AGCCCCAGCT ACACCTACTA CCGCCCCGGC GTGAACCTGA 
1301 GCCTGAGCTG CCACGCCGCC AGCAACCCCC CCGCCCAGTA CAGCTGGCTG 
ATCGACGGCA ACATCCAGCA GCACACCCAG GAGCTGTTCA TCAGCAACAT 
1401 CACCGAGAAG AACAGCGGCC TGTACACCTG CCAGGCCAAC AACAGCGCCA 
GCGGCCACAG CCGCACCACC GTGAAGACCA TCACC6TGAG CGCCGAGCTG 
1501 CCCAAGCCCA GCATCAGCAG CAACAACAGC AAGCCCGTGG AGGACAAGGA 
CGCCGTGGCC TTCACCTGCG AGCCCGAGGC CCAGAACACC ACCTACCTGT 
1601 GGTGGGTGAA CGGCCAGAGC CTGCCCGTGA GCCCCCGCCT GCAGCTGAGC 
AACGGCAACC GCACCCTGAC CCTGTTCAAC GTGACCCGCA ACGACGCCCG 
1701 CGCCTACGTG TGCGGCATCC AGAACAGCGT GAGCGCCAAC CGCAGCGACC 
CCGTGACCCT 6GACGTGCTG TACGGCCCCG ACACCCCCAT CATCAGCCCC 
1801 CCCGACAGCA GCTACCTGAG CGGCGCCAAC GTGAACCTGA GCTGCCACAG 
CGCCAGCAAC CCCAGCCCCC AGTACAGCTG GCGCATCAAC GGCATCCCCC 
1901 AGCAGCACAC CCAGGTGCTG TTCATCGCCA AGATCACCCC CAACAACAAC 
GGCACCTACG CCTGCTTCGT GAGCAACCTG GCCACCGGCC GCAACAACAG 
2001 CATCGTGAAG AGCATCACCG TGAGCGCCAG CGGC (SEQ ID NO: 15) 
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FIGURE lOB. Amino Acid Sequence of Human CEA-AAD 

1 MESPSAPPHR WCIPWQRLLL TASLLTFWNP PTTAKLTIES TPFNVAEGKE 

51 VLLLVHNLPQ HLFGYSWYKG ERVDGNRQII GYVIGTQQAT PGPAYSGREI 

101 lYPNASIiLIQ JSniQNDTGFY TLHVIKSDLV NEEATGQFRV YPELPKPSIS 

151 SNNSKPVEDK DAVAFTCEPE TQDATYLWWV NNQSLPVSPR LQLSNGNRTL 

201 TLFNVTRNDT ASYKCETQNP VSARRSDSVI LNVLYGPDAP TISPLNTSYR 

251 SGENLNLSCH AASNPPAQYS WFVNGTFQQS TQELFIPNIT VNNSGSYTCQ 

301 AHNSDTGLNR TTVTTITVYA EPPKPFITSN NSNPVEDEDA VALTCEPEIQ 

351 NTTYLWWVNN QSLPVSPRLQ LSNDNRTLTL LSVTRNDVGP YECGIQNELS 

401 VDHSDPVILN VLYGPDDPTI SPSYTYYRPG VNLSLSCHAA SNPPAQYSWL 

451 IDGNIQQHTQ ELFISNITEK NSGLYTCQAN NSASGHSRTT VKTITVSAEL 

501 PKPSISSNNS KPVEDKDAVA FTCEPEAQNT TYLWWVNGQS LPVSPRLQIiS 

551 NGNRTLTLFN VTRNDARAYV CGIQNSVSAN RSDPVTLDVL YGPDTPIISP 

601 PDSSYLSGAN LNLSCHSASN PSPQYSWRIN GIPQQHTQVL FIAKITPNNN 

651 GTYACFVSNL ATGRNNSIVK SITVSASG (SEQ ID NO: 16) 
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SEQUENCE LISTING 

<110> Istituto Di Ricerche Di Biologia Molecolar P. Angeletti S.P.A. 
Lamonica, Nicola 
Mennuni , Camela 
Savino, Rocco 
Leihin, Arxciin 

<120> SYNTHETIC GENE ENCODING HUMAN 

CARCINOEMBRYONIC ANTIGEN AND USES THEREOF 



<130> ITR0044Y 

<150> 60/467,971 
<151> 2003-05-05 

<150> Unknown 
<151> 2004-02-11 

<160> 16 

<170> FastSEQ for Windows Version 4.0 

<210> 1 

<211> 2109 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> hCEAopt 
<400> 1 

atggagagcc ccagcgcccc cccccaccgc tggtgcatcc cctggcagcg 
accgccagcc tgctgacctt ctggaacccc cccaccaccg ccaagctgac 
acccccttca acgtggccga gggcaaggag gtgctgctgc tggtgcacaa 
cacctgttcg gctacagctg gtacaagggc gagcgcgtgg acggcaaccg 
ggctacgtga tcggcaccca gcaggccacc cccggccccg cctacagcgg 
atctacccca acgccagcct gctgatccag aacatcatcc agaacgacac 
accctgcacg tgatcaagag cgacctggtg aacgaggagg ccaccggcca 
taccccgagc tgcccaagcc cagcatcagc agcaacaaca gcaagcccgt 
gacgccgtgg ccttcacctg cgagcccgag acccaggacg ccacctacct 
aacaaccaga gcctgcccgt gagcccccgc ctgcagctga gcaacggcaa 
accctgttca acgtgacccg caacgacacc gccagctaca agtgcgagac 
gtgagcgccc gccgcagcga cagcgtgatc ctgaacgtgc tgtacggccc 
accatcagcc ccctgaacac cagctaccgc agcggcgaga acctgaacct 
gccgccagca acccccccgc ccagtacagc tggttcgtga acggcacctt 
acccaggagc tgttcatccc caacatcacc gtgaacaaca gcggcagcta 
gcccacaaca gcgacaccgg cctgaaccgc accaccgtga ccaccatcac 
gagcccccca agcccttcat caccagcaac aacagcaacc ccgtggagga 
gtggccctga cctgcgagcc cgagatccag aacaccacct acctgtggtg 
cagagcctgc ccgtgagccc ccgcctgcag ctgagcaacg acaaccgcac 
ctgagcgtga cccgcaacga cgtgggcccc tacgagtgcg gcatccagaa 
gtggaccaca gcgaccccgt gatcctgaac gtgctgtacg gccccgacga 



cctgctgctg 60 
catcgagagc 120 
cctgccccag 180 
ccagatcatc 240 
ccgcgagatc 300 
cggcttctac 360 
gttccgcgtg 420 
ggaggacaag 480 
gtggtgggtg 540 
ccgcaccctg 600 
ccagaacccc 660 
cgacgccccc 720 
gagctgccac 780 
ccagcagagc 840 
cacctgccag 900 
cgtgtacgcc 960 
cgaggacgcc 1020 
ggtgaacaac 1080 
cctgaccctg 1140 
cgagctgagc 1200 
ccccaccatc 1260 
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agccccagct acacctacta 
agcaaccccc ccgcccagta 
gagctgttca tcagcaacat 
aacagcgcca gcggccacag 
cccaagccca gcatcagcag 
ttcacctgcg agcccgaggc 
ctgcccgtga gcccccgcct 
gtgacccgca acgacgcccg 
cgcagcgacc ccgtgaccct 
cccgacagca gctacctgag 
cccagccccc agtacagctg 
ttcatcgcca agatcacccc 
gccaccggcc gcaacaacag 
cccggcctga gcgccggcgc 
ctgatctga 

<210> 2 
<211> 702 

<212> PRT 

<213> Homo sapiens 



<400> 2 



Met 


Glu 


Ser 


Pro 


Ser 


Ala 


Pro 


Pro 


His 


Arg 


Trp 


Cys 


lie 


Pro 


Trp 


Gin 


1 








5 










10 










15 






Leu 


Leu 


Leu 


Thr 


Ala 


Ser 


Leu 


Leu 


Thr 


Phe 


Trp 


Asn 


Pro 


Pro 


Thr 








20 










25 










30 






Thr 


Ala 


Lys 


Leu 


Thr 


He 


Glu 


Ser 


Thr 


Pro 


Phe 


Asn 


Val 


Ala 


Glu 


Gly 






35 










40 










45 








Lys 


Glu 


Val 


Leu 


Leu 


Leu 


Val 


His 


Asn 


Leu 


Pro 


Gin 


His 


Leu 


Phe 


Gly 




50 










55 










60 










lyr 


Ser 


Trp 


Tyr 


Lys 


Gly 


Glu 


Arg 


Val 


Asp 


Gly 


Asn 


Arg 


Gin 


He 


He 


65 










70 










75 










80 


Gly 


Tyr 


Val 


He 


Gly 


Thr 


Gin 


Gin 


Ala 


Thr 


Pro 


Gly 


Pro 


Ala 


Tyr 


Ser 










85 










90 










95 




Gly 


Arg 


Glu 


He 


He 


Tyr 


Pro 


Asn 


Ala 


Ser 


Leu 


Leu 


He 


Gin 


Asn 


He 








100 










105 










110 






lie 


Gin 


Asn 


Asp 


Thr 


Gly 


Phe 


Tyr 


Thr 


Leu 


His 


Val 


He 


Lys 


Ser 


Asp 






115 










120 










125 








Leu 


Val 


Asn 


Glu 


Glu 


Ala 


Thr 


Gly 


Gin 


Phe 


Arg 


Val 


Tyr 


Pro 


Glu 


Leu 




130 










135 










140 










Pro 


Lys 


Pro 


Ser 


He 


Ser 


Ser 


Asn 


Asn 


Ser 


Lys 


Pro 


Val 


Glu 


Asp 


Lys 


145 










150 










155 










160 


Asp 


Ala 


Val 


Ala 


Phe 


Thr 


Cys 


Glu 


Pro 


Glu 


Thr 


Gin 


Asp 


Ala 


Thr 


Tyr 










165 










170 










175 




Leu 


Trp 


Trp 


Val 


Asn 


Asn 


Gin 


Ser 


Leu 


Pro 


Val 


Ser 


Pro 


Arg 


Leu 


Gin 








180 










185 










190 






Leu 


Ser 


Asn 


Gly 


Asn 


Arg 


Thr 


Leu 


Thr 


Leu 


Phe 


Asn 


Val 


Tlir 


Arg 


Asn 






195 










200 










205 








Asp 


Thr 


Ala 


Ser 


Tyr 


Lys 


Cys 


Glu 


Thr 


Gin 


Asn 


Pro 


Val 


Ser 


Ala 


Arg 




210 










215 










220 










Arg 


Ser 


Asp 


Ser 


Val 


He 


Leu 


Asn 


Val 


Leu 


Tyr 


Gly 


Pro 


Asp 


Ala 


Pro 


225 










230 










235 










240 


Thr 


He 


Ser 


Pro 


Leu 


Asn 


Thr 


Ser 


Tyr 


Arg 


Ser 


Gly 


Glu 


Asn 


Leu 


Asn 










245 










250 










255 




Leu 


Ser 


Cys 


His 


Ala 


Ala 


Ser 


Asn 


Pro 


Pro 


Ala 


Gin 


Tyr 


Ser 


Trp 


Phe 



ccgccccggc gtgaacctga 
cagctggctg atcgacggca 
caccgagaag aacagcggcc 
ccgcaccacc gtgaagacca 
caacaacagc aagcccgtgg 
ccagaacacc acctacctgt 
gcagctgagc aacggcaacc 
cgcctacgtg tgcggcatcc 
ggacgtgctg tacggccccg 
cggcgccaac ctgaacctga 
gcgcatcaac ggcatccccc 
caacaacaac ggcacctacg 
catcgtgaag agcatcaccg 
caccgtgggc atcatgatcg 



gcctgagctg ccacgccgcc 1320 
acatccagca gcacacccag 1380 
tgtacacctg ccaggccaac 1440 
tcaccgtgag cgccgagctg 1500 
aggacaagga cgccgtggcc 1560 
ggtgggtgaa cggccagagc 1620 
gcaccctgac cctgttcaac 1680 
agaacagcgt gagcgccaac 1740 
acacccccat catcagcccc 1800 
gctgccacag cgccagcaac 1860 
agcagcacac ccaggtgctg 1920 
cctgcttcgt gagcaacctg 1980 
tgagcgccag cggcaccagc 2040 
gcgtgctggt gggcgtggcc 2100 

2109 
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260 265 270 

Val Asn Gly Thr Phe Gin Gin Ser Thr Gin Glu Leu Phe lie Pro Asn 

275 280 285 

lie Thr Val Asn Asn Ser Gly Ser Tyr Thr Cys Gin Ala His Asn Ser 

290 295 300 

Asp Thr Gly Leu Asn Arg Thr Thr Val Thr Thr lie Thr Val Tyr Ala 
305 310 315 320 

Glu Pro Pro Lys Pro Phe lie Thr Ser Asn Asn Ser Asn Pro Val Glu 

325 330 335 

Asp Glu Asp Ala Val Ala Leu Thr Cys Glu Pro Glu lie Gin Asn Thr 

340 345 350 

Thr Tyr Leu Trp Tzp Val Asn Asn Gin Ser Leu Pro Val Ser Pro Arg 

355 360 365 

Leu Gin Leu Ser Asn Asp Asn Arg Thr Leu Thr Leu Leu Ser Val Thr 

370 375 380 

Arg Asn Asp Val Gly Pro Tyr Glu Cys Gly He Gin Asn Glu Leu Ser 
385 390 395 400 

Val Asp His Ser Asp Pro Val He Leu Asn Val Leu Tyr Gly Pro Asp 

405 410 415 

Asp Pro Thr He Ser Pro Ser Tyr Thr Tyr Tyr Arg Pro Gly Val Asn 

420 425 430 

Leu Ser Leu Ser Cys His Ala Ala Ser Asn Pro Pro Ala Gin Tyr Ser 

435 440 445 

Trp Leu He Asp Gly Asn He Gin Gin His Thr Gin Glu Leu Phe He 

450 455 460 

Ser Asn He Thr Glu Lys Asn Ser Gly Leu Tyr Thr Cys Gin Ala Asn 
465 470 475 480 

Asn Ser Ala Ser Gly His Ser Arg Thr Thr Val Lys Thr He Thr Val 

485 490 495 

Ser Ala Glu Leu Pro Lys Pro Ser He Ser Ser Asn Asn Ser Lys Pro 

500 505 510 

Val Glu Asp Lys Asp Ala Val Ala Phe Thr Cys Glu Pro Glu Ala Gin 

515 520 525 

Asn Thr Thr Tyr Leu Trp Trp Val Asn Gly Gin Ser Leu Pro Val Ser 

530 535 540 

Pro Arg Leu Gin Leu Ser Asn Gly Asn Arg Thr Leu Thr Leu Phe Asn 
545 550 555 560 

Val Thir Arg Asn Asp Ala Arg Ala Tyr Val Cys Gly He Gin Asn Ser 

565 570 575 

Val Ser Ala Asn Arg Ser Asp Pro Val Thr Leu Asp Val Leu Tyr Gly 

580 585 590 

Pro Asp Thr Pro He He Ser Pro Pro Asp Ser Ser Tyr Leu Ser Gly 

595 600 605 

Ala Asn Leu Asn Leu Ser Cys His Ser Ala Ser Asn Pro Ser Pro Gin 

610 615 620 

Tyr Ser Trp Arg He Asn Gly He Pro Gin Gin His Thr Gin Val Leu 
625 630 635 640 

Phe He Ala Lys He Thr Pro Asn Asn Asn Gly Thr Tyr Ala Cys Phe 

645 650 655 

Val Ser Asn Leu Ala Thr Gly Arg Asn Asn Ser He Val Lys Ser He 

660 665 670 

Thr Val Ser Ala Ser Gly Thr Ser Pro Gly Leu Ser Ala Gly Ala Thr 

675 680 685 

Val Gly He Met He Gly Val Leu Val Gly Val Ala Leu He 
690 695 700 
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<210> 3 

<211> 2109 

<212> DNA 

<213> Homo sapiens 

<400> 3 

atggagtctc cctcggcccc tccccacaga tggtgcatcc cctggcagag gctcctgctc 60 
acagcctcac ttctaacctt ctggaacccg cccaccactg ccaagctcac tattgaatcc 120 
acgccgttca atgtcgcaga ggggaaggag gtgcttctac ttgtccacaa tctgccccag 180 
catctttttg gctacagctg gtacaaaggt gaaagagtgg atggcaaccg tcaaattata 240 
ggatatgtaa taggaactca acaagctacc ccagggcccg catacagtgg tcgagagata 300 
atatacccca atgcatccct gctgatccag aacatcatcc agaatgacac aggattctac 360 
accctacacg tcataaagtc agatcttgtg aatgaagaag caactggcca gttccgggta 420 
tacccggagc tgcccaagcc ctccatctcc .agcaacaact ccaaacccgt ggaggacaag 480 
gatgctgtgg ccttcacctg tgaacctgag actcaggacg caacctacct gtggtgggta 540 
aacaatcaga gcctcccggt cagtcccagg ctgcagctgt ccaatggcaa caggaccctc 600 
actctattca atgtcacaag aaatgacaca gcaagctaca aatgtgaaac ccagaaccca 660 
gtgagtgcca ggcgcagtga ttcagtcatc ctgaatgtcc tctatggccc ggatgccccc 720 
accatttccc ctctaaacac atcttacaga tcaggggaaa atctgaacct ctcctgccac 780 
gcagcctcta acccacctgc acagtactct tggtttgtca atgggacttt ccagcaatcc 840 
acccaagagc tctttatccc caacatcact gtgaataata gtggatccta tacgtgccaa 900 
gcccataact cagacactgg cctcaatagg accacagtca cgacgatcac agtctatgca 960 
gagccaccca aacccttcat caccagcaac aactccaacc ccgtggagga tgaggatgct 1020 
gtagccttaa cctgtgaacc tgagattcag aacacaacct acctgtggtg ggtaaataat 1080 
cagagcctcc cggtcagtcc caggctgcag ctgtccaatg acaacaggac cctcactcta 1140 
ctcagtgtca caaggaatga tgtaggaccc tatgagtgtg gaatccagaa cgaattaagt 1200 
gttgaccaca gcgacccagt catcctgaat gtcctctatg gcccagacga ccccaccatt 1260 
tccccctcat acacctatta ccgtccaggg gtgaacctca gcctctcctg ccatgcagcc 1320 
tctaacccac ctgcacagta ttcttggctg attgatggga acatccagca acacacacaa 1380 
gagctcttta tctccaacat cactgagaag aacagcggac tctatacctg ccaggccaat 1440 
aactcagcca gtggccacag caggactaca gtcaagacaa tcacagtctc tgcggagctg 1500 
cccaagccct ccatctccag caacaactcc aaacccgtgg aggacaagga tgctgtggcc 1560 
ttcacctgtg aacctgaggc tcagaacaca acctacctgt ggtgggtaaa tggtcagagc 1620 
ctcccagtca gtcccaggct gcagctgtcc aatggcaaca ggaccctcac tctattcaat 1680 
gtcacaagaa atgacgcaag agcctatgta tgtggaatcc agaactcagt gagtgcaaac 1740 
cgcagtgacc cagtcaccct ggatgtcctc tatgggccgg acacccccat catttccccc 1800 
ccagactcgt cttacctttc gggagcgaac ctcaacctct cctgccactc ggcctctaac 1860 
ccatccccgc agtattcttg gcgtatcaat gggataccgc agcaacacac acaagttctc 1920 
tttatcgcca aaatcacgcc aaataataac gggacctatg cctgttttgt ctctaacttg 1980 
gctactggcc gcaataattc catagtcaag agcatcacag tctctgcatc tggaacttct 2040 
cctggtctct cagctggggc cactgtcggc atcatgattg gagtgctggt tggggttgct 2100 
ctgatatag 2109 

<210> 4 
<211> 15 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> peptide 
<400> 4 

Thr Tyr Tyr Arg Pro Gly Val Asn Leu Ser Leu Ser Cys His Ala 
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15 10 15 



<210> 5 
<211> 15 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> peptide 
<400> 5 

Asn Thr Thr Tyr Leu Trp Trp Val Asn Gly Gin Ser Leu Pro Val 
15 10 15 



<210> 6 
<211> 15 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> peptide 
<400> 6 

Tyr Val Cys Gly lie Gin Asn Ser Val Ser Ala Asn Arg Ser Asp 
15 10 15 



<210> 7 
<211> 15 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> peptide 
<400> 7 

Ser Ala Ser Asn Pro Ser Pro Gin Tyr Ser Trp Arg He Asn Gly 
1-5 10 15 



<210> 8 
<211> 15 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> peptide 
<400> 8 

Val He Leu Asn Val Leu Tyr Gly Pro Asp Ala Pro Thr He Ser 
1 5 • 10 15 
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<210> 9 
<211> 15 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> peptide 
<400> 9 

Gly Pro Tyr Glu Cys Gly He Gin Asn Glu Leu Ser Val Asp His 
15 10 15 



<210> 10 
<211> 15 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> peptide 
<400> 10 

Ser Pro Ser Tyr Thr Tyr Tyr Arg Pro Gly Val Asn Leu Ser Leu 
15 10 15 



<210> 11 
<211> 15 
<212> PRT 

<213> Artificial Sequence 
<22D> 

<223> peptide 
<400> 11 

Pro Ser Pro Gin Tyr Ser Trp Arg He Asn Gly He Pro Gin Gin 
15 10 15 



<210> 12 
<211> 15 
<212> PRT 

<213> Artificial Sequence 

<220> 

<223> peptide 
<400> 12 

Asn Asn Ser lie Val Lys Ser He Thr Val Ser Ala Ser Gly Thr 
15 10 15 



<210> 13 
<211> 9 
<212> DNA 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Kozak sequence 

<400> 13 
gccgccacc 

<210> 14 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> T-cell epitope 
<400> 14 

Cys 6ly lie Gin Asn Ser Val Ser Ala 
1 5 



<210> 15 
<211> 2034 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> hCEA-AAD 
<400> 15 

atggagagcc ccagcgcccc cccccaccgc 
accgccagcc tgctgacctt ctggaacccc 
acccccttca acgtggccga gggcaaggag 
cacctgttcg gctacagctg gtacaagggc 
ggctacgtga tcggcaccca gcaggccacc 
atctacccca acgccagcct gctgatccag 
accctgcacg tgatcaagag cgacctggtg 
taccccgagc tgcccaagcc cagcatcagc 
gacgccgtgg ccttcacctg cgagcccgag 
aacaaccaga gcctgcccgt gagcccccgc 
accctgttca acgtgacccg caacgacacc 
gtgagcgccc gccgcagcga cagcgtgatc 
accatcagcc ccctgaacac cagctaccgc 
gccgccagca acccccccgc ccagtacagc 
acccaggagc tgttcatccc caacatcacc 
gcccacaaca gcgacaccgg cctgaaccgc 
gagcccccca agcccttcat caccagcaac 
gtggccctga cctgcgagcc cgagatccag 
cagagcctgc ccgtgagccc ccgcctgcag 
ctgagcgtga cccgcaacga cgtgggcccc 
gtggaccaca gcgaccccgt gatcctgaac 
agccccagct acacctacta ccgccccggc 
agcaaccccc ccgcccagta cagctggctg 
gagctgttca tcagcaacat caccgagaag 



tggtgcatcc cctggcagcg cctgctgctg 60 
cccaccaccg ccaagctgac catcgagagc 120 
gtgctgctgc tggtgcacaa cctgccccag 180 
gagcgcgtgg acggcaaccg ccagatcatc 240 
cccggccccg cctacagcgg ccgcgagatc 300 
aacatcatcc agaacgacac cggctt<;tac 360 
aacgaggagg ccaccggcca gttccgcgtg 420 
agcaacaaca gcaagcccgt ggaggacaag 480 
acccaggacg ccacctacct gtggtgggtg 540 
ctgcagctga gcaacggcaa ccgcaccctg 600 
gccagctaca agtgcgagac ccagaacccc 660 
ctgaacgtgc tgtacggccc cgacgccccc 720 
agcggcgaga acctgaacct gagctgccac 780 
tggttcgtga acggcacctt ccagcagagc 840 
gtgaacaaca gcggcagcta cacctgccag 900 
accaccgtga ccaccatcac cgtgtacgcc 960 
aacagcaacc ccgtggagga cgaggacgcc 1020 
aacaccacct acctgtggtg ggtgaacaac 1080 
ctgagcaacg acaaccgcac cctgaccctg 1140 
tacgagtgcg gcatccagaa cgagctgagc 1200 
gtgctgtacg gccccgacga ccccaccatc 1260 
gtgaacctga gcctgagctg ccacgccgcc 1320 
atcgacggca acatccagca gcacacccag 1380 
aacagcggcc tgtacacctg ccaggccaac 1440 
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aacagcgcca gcggccacag ccgcaccacc gtgaagacca tcaccgtgag cgccgagctg 1500 
cccaagccca gcatcagcag caacaacagc aagcccgtgg aggacaagga cgccgtggcc 1560 
ttcacctgcg agcccgaggc ccagaacacc acctacctgt ggtgggtgaa cggccagagc 1620 
ctgcccgtga gcccccgcct gcagctgagc aacggcaacc gcaccctgac cctgttcaac 1680 
gtgacccgca acgacgcccg cgcctacgtg tgcggcatcc agaacagcgt gagcgccaac 1740 
cgcagcgacc ccgtgaccct ggacgtgctg tacggccccg acacccccat catcagcccc 1800 
cccgacagca gctacctgag cggcgccaac ctgaacctga gctgccacag cgccagcaac 1860 
cccagccccc agtacagctg gcgcatcaac ggcatccccc agcagcacac ccaggtgctg 1920 
ttcatcgcca agatcacccc caacaacaac ggcacctacg cctgcttcgt gagcaacctg 1980 
gccaccggcc gcaacaacag catcgtgaag agcatcaccg tgagcgccag cggc 2034 

<210> 16 

<211> 678 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> hCEA-AAD 
<400> 16 

Met Glu Ser Pro Ser Ala Pro Pro His Arg Trp Cys He Pro Trp Gin 

15 10 15 

Arg Leu Leu Leu Thr Ala Ser Leu Leu Thr Phe Trp Asn Pro Pro Thr 

20 25 30 

Thr Ala Lys Leu Thr He Glu Ser Thr Pro Phe Asn Val Ala Glu Gly 

35 40 45 

Lys Glu Val Leu Leu Leu Val His Asn Leu Pro Gin His Leu Phe Gly 

50 55 60 

Tyr Ser Trp Tyr Lys Gly Glu Arg Val Asp Gly Asn Arg Gin He He 
65 70 75 80 

Gly Tyr Val He Gly Thr Gin Gin Ala Thr Pro Gly Pro Ala Tyr Ser 

85 90 95 

Gly Arg Glu He He Tyr Pro Asn Ala Ser Leu Leu He Gin Asn He 

100 105 110 

He Gin Asn Asp Thr Gly Phe Tyr Thr Leu His Val He Lys Ser Asp 

115 120 125 

Leu Val Asn Glu Glu Ala Thr Gly Gin Phe Arg Val Tyr Pro Glu Leu 

130 135 140 

Pro Lys Pro Ser He Ser Ser Asn Asn Ser Lys Pro Val Glu Asp Lys 

150 155 160 

Asp Ala Val Ala Phe Thr Cys Glu Pro Glu Thr Gin Asp Ala Thr Tyr 

165 170 175 

Leu Trp Trp Val Asn Asn Gin Ser Leu Pro Val Ser Pro Arg Leu Gin 

180 185 190 

Leu Ser Asn Gly Asn Arg Thr Leu Thr Leu Phe Asn Val Thr Arg Asn 

195 200 205 

Asp Thr Ala Ser Tyr Lys Cys Glu Thr Gin Asn Pro Val Ser Ala Arg 

210 215 220 

Arg Ser Asp Ser Val He Leu Asn Val Leu Tyr Gly Pro Asp Ala Pro 
225 230 235 240 

Thr He Ser Pro Leu Asn Thr Ser Tyr Arg Ser Gly Glu Asn Leu Asn 

245 250 255 

Leu Ser Cys His Ala Ala Ser Asn Pro Pro Ala Gin Tyr Ser Trp Phe 

260 265 270 

Val Asn Gly Thr Phe Gin Gin Ser Thr Gin Glu Leu Phe He Pro Asn 
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lie Thr Val Asn Asn Ser Gly Ser Tyr Thr Cys Gin Ala His Asn Ser 

290 295 300 

Asp Thr Gly Leu Asn Arg Thr Thr Val Thr Thr lie Thr Val Tyr Ala 
305 310 315 320 

Glu Pro Pro Lys Pro Phe lie Thr Ser Asn Asn Ser Asn Pro Val Glu 

325 330 335 

Asp Glu Asp Ala Val Ala Leu Thr Cys Glu Pro Glu lie Gin Asn Thr 

340 345 350 

Thr Tyr Leu Trp Trp Val Asn Asn Gin Ser Leu Pro Val Ser Pro Arg 

355 360 365 

Leu Gin Leu Ser Asn Asp Asn Arg Thr Leu Thr Leu Leu Ser Val Thr 

370 375 380 

Arg Asn Asp Val Gly Pro Tyr Glu Cys Gly lie Gin Asn Glu Leu Ser 
385 390 395 400 

Val Asp His Ser Asp Pro Val lie Leu Asn Val Leu Tyr Gly Pro Asp 

405 410 415 

Asp Pro Thr lie Ser Pro Ser Tyr Thr Tyr Tyr Arg Pro Gly Val Asn 

420 425 430 

Leu Ser Leu Ser Cys His Ala Ala Ser Asn Pro Pro Ala Gin Tyr Ser 

435 440 445 

Trp Leu lie Asp Gly Asn He Gin Gin His Thr Gin Glu Leu Phe He 

450 455 460 

Ser Asn He Thr Glu Lys Asn Ser Gly Leu Tyr Thr Cys Gin Ala Asn 
465 470 475 480 

Asn Ser Ala Ser Gly His Ser Arg Thr Thr Val Lys Thr lie Thr Val 

485 490 495 

Ser Ala Glu Leu Pro Lys Pro Ser lie Ser Ser Asn Asn Ser Lys Pro 

500 505 510 

Val Glu Asp Lys Asp Ala Val Ala Phe Thr Cys Glu Pro Glu Ala Gin 

515 520 525 

Asn Thr Thr Tyr Leu Trp Trp Val Asn Gly Gin Ser Leu Pro Val Ser 

530 535 540 

Pro Arg Leu Gin Leu Ser Asn Gly Asn Arg Thr Leu Thr Leu Phe Asn 
545 550 555 560 

Val Thr Arg Asn Asp Ala Arg Ala Tyr Val Cys Gly He Gin Asn Ser 

565 570 575 

Val Ser Ala Asn Arg Ser Asp Pro Val Thr Leu Asp Val Leu Tyr Gly 

580 ' 585 590 

Pro Asp Thr Pro He He Ser Pro Pro Asp Ser Ser Tyr Leu Ser Gly 

595 600 605 

Ala Asn Leu Asn Leu Ser Cys His Ser Ala Ser Asn Pro Ser Pro Gin 

610 615 620 

Tyr Ser Trp Arg He Asn Gly He Pro Gin Gin His Thr Gin Val Leu 
625 630 635 640 

Phe He Ala Lys He Thr Pro Asn Asn Asn Gly Thr Tyr Ala Cys Phe 

645 650 655 

Val Ser Asn Leu Ala Thr Gly Arg Asn Asn Ser He Val Lys Ser He 

660 665 670 

Thr Val Ser Ala Ser Gly 
675 
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